Overview
Brought to you by YData
Dataset statistics
| Number of variables | 64 |
|---|---|
| Number of observations | 601451 |
| Missing cells | 14933142 |
| Missing cells (%) | 38.8% |
| Total size in memory | 293.7 MiB |
| Average record size in memory | 512.0 B |
Variable types
| Text | 64 |
|---|
Dataset
| Description | Mammal NMNH Extant Specimen Records 0054884-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.dys66y |
collectionID has constant value "urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22" | Constant |
collectionCode has constant value "MAMM" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
kingdom has constant value "Animalia" | Constant |
phylum has constant value "Chordata" | Constant |
class has constant value "Mammalia" | Constant |
taxonRank has constant value "subspecies" | Constant |
recordNumber has 50821 (8.4%) missing values | Missing |
recordedBy has 55563 (9.2%) missing values | Missing |
lifeStage has 549447 (91.4%) missing values | Missing |
preparations has 26965 (4.5%) missing values | Missing |
associatedMedia has 45503 (7.6%) missing values | Missing |
associatedSequences has 600397 (99.8%) missing values | Missing |
occurrenceRemarks has 590662 (98.2%) missing values | Missing |
eventDate has 28127 (4.7%) missing values | Missing |
startDayOfYear has 46793 (7.8%) missing values | Missing |
endDayOfYear has 46765 (7.8%) missing values | Missing |
year has 28127 (4.7%) missing values | Missing |
month has 44866 (7.5%) missing values | Missing |
day has 67482 (11.2%) missing values | Missing |
verbatimEventDate has 36490 (6.1%) missing values | Missing |
habitat has 468915 (78.0%) missing values | Missing |
waterBody has 539858 (89.8%) missing values | Missing |
islandGroup has 596682 (99.2%) missing values | Missing |
island has 564842 (93.9%) missing values | Missing |
country has 6532 (1.1%) missing values | Missing |
stateProvince has 93954 (15.6%) missing values | Missing |
county has 447402 (74.4%) missing values | Missing |
locality has 35404 (5.9%) missing values | Missing |
minimumElevationInMeters has 496901 (82.6%) missing values | Missing |
maximumElevationInMeters has 597572 (99.4%) missing values | Missing |
verbatimElevation has 599861 (99.7%) missing values | Missing |
minimumDepthInMeters has 601448 (> 99.9%) missing values | Missing |
decimalLatitude has 448433 (74.6%) missing values | Missing |
decimalLongitude has 448433 (74.6%) missing values | Missing |
geodeticDatum has 594543 (98.9%) missing values | Missing |
verbatimLatitude has 466631 (77.6%) missing values | Missing |
verbatimLongitude has 466723 (77.6%) missing values | Missing |
verbatimCoordinateSystem has 468202 (77.8%) missing values | Missing |
georeferenceProtocol has 592196 (98.5%) missing values | Missing |
georeferenceRemarks has 601383 (> 99.9%) missing values | Missing |
identificationQualifier has 599947 (99.7%) missing values | Missing |
typeStatus has 597685 (99.4%) missing values | Missing |
identifiedBy has 593267 (98.6%) missing values | Missing |
subgenus has 601149 (99.9%) missing values | Missing |
infraspecificEpithet has 314922 (52.4%) missing values | Missing |
taxonRank has 314922 (52.4%) missing values | Missing |
scientificNameAuthorship has 555607 (92.4%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:48:32.631479 |
|---|---|
| Analysis finished | 2025-01-14 16:48:48.352677 |
| Duration | 15.72 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 601451 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 601451 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1322535732 |
|---|---|
| 2nd row | 1322538146 |
| 3rd row | 1317206206 |
| 4th row | 1317210025 |
| 5th row | 1317210456 |
| Value | Count | Frequency (%) |
| 1322535732 | 1 | < 0.1% |
| 1322555094 | 1 | < 0.1% |
| 1322560018 | 1 | < 0.1% |
| 1322558352 | 1 | < 0.1% |
| 1317224532 | 1 | < 0.1% |
| 4041103536 | 1 | < 0.1% |
| 1317206206 | 1 | < 0.1% |
| 1317210025 | 1 | < 0.1% |
| 1317210456 | 1 | < 0.1% |
| 1317211504 | 1 | < 0.1% |
| Other values (601441) | 601441 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6014510 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6014510 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6014510 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1342473 | |
| 3 | 953825 | |
| 2 | 772027 | |
| 8 | 469400 | 7.8% |
| 9 | 463026 | 7.7% |
| 0 | 459240 | 7.6% |
| 7 | 444579 | 7.4% |
| 4 | 377786 | 6.3% |
| 5 | 367488 | 6.1% |
| 6 | 364666 | 6.1% |
modified
Text
| Distinct | 29672 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 12662 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 2021-08-09 14:50:00 |
|---|---|
| 2nd row | 2020-04-09 11:54:00 |
| 3rd row | 2020-03-17 10:16:00 |
| 4th row | 2020-05-20 10:50:00 |
| 5th row | 2017-12-08 15:28:00 |
| Value | Count | Frequency (%) |
| 2017-12-08 | 28553 | 2.4% |
| 2021-01-15 | 25810 | 2.1% |
| 2020-07-24 | 12948 | 1.1% |
| 2020-04-09 | 11060 | 0.9% |
| 2020-03-12 | 10837 | 0.9% |
| 2020-04-13 | 9731 | 0.8% |
| 2020-04-14 | 8525 | 0.7% |
| 2020-04-06 | 8277 | 0.7% |
| 2020-03-25 | 8028 | 0.7% |
| 2020-04-02 | 7941 | 0.7% |
| Other values (2209) | 1071192 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 10.5% |
| : | 1202902 | 10.5% |
| 601451 | 5.3% | |
| 4 | 455860 | 4.0% |
| 3 | 439973 | 3.9% |
| 5 | 428273 | 3.7% |
| 9 | 215795 | 1.9% |
| Other values (3) | 567144 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8420314 | |
| Dash Punctuation | 1202902 | 10.5% |
| Other Punctuation | 1202902 | 10.5% |
| Space Separator | 601451 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| 4 | 455860 | 5.4% |
| 3 | 439973 | 5.2% |
| 5 | 428273 | 5.1% |
| 9 | 215795 | 2.6% |
| 6 | 207959 | 2.5% |
| 7 | 187569 | 2.2% |
| 8 | 171616 | 2.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1202902 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 |
Space Separator
| Value | Count | Frequency (%) |
| 601451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11427569 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 10.5% |
| : | 1202902 | 10.5% |
| 601451 | 5.3% | |
| 4 | 455860 | 4.0% |
| 3 | 439973 | 3.9% |
| 5 | 428273 | 3.7% |
| 9 | 215795 | 1.9% |
| Other values (3) | 567144 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11427569 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3217264 | |
| 2 | 1683114 | |
| 1 | 1412891 | |
| - | 1202902 | 10.5% |
| : | 1202902 | 10.5% |
| 601451 | 5.3% | |
| 4 | 455860 | 4.0% |
| 3 | 439973 | 3.9% |
| 5 | 428273 | 3.7% |
| 9 | 215795 | 1.9% |
| Other values (3) | 567144 | 5.0% |
institutionID
Text
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 28.8108624 |
| Min length | 2 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 596967 | |
| nsmt | 977 | 0.2% |
| uam | 775 | 0.1% |
| nrm | 386 | 0.1% |
| rmnh | 354 | 0.1% |
| rcs | 246 | < 0.1% |
| nmv | 238 | < 0.1% |
| nmsz | 188 | < 0.1% |
| zmmu | 179 | < 0.1% |
| fcmm | 127 | < 0.1% |
| Other values (40) | 1015 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2387868 | |
| : | 2387868 | |
| l | 1790901 | 10.3% |
| i | 1193934 | 6.9% |
| r | 1193934 | 6.9% |
| c | 1193934 | 6.9% |
| g | 596967 | 3.4% |
| 7 | 596967 | 3.4% |
| 8 | 596967 | 3.4% |
| 4 | 596967 | 3.4% |
| Other values (31) | 4792015 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11342373 | |
| Other Punctuation | 2984837 | 17.2% |
| Decimal Number | 2984835 | 17.2% |
| Uppercase Letter | 16276 | 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4384 | |
| N | 2583 | |
| S | 1796 | |
| A | 1319 | 8.1% |
| U | 1175 | 7.2% |
| R | 1035 | 6.4% |
| T | 978 | 6.0% |
| C | 551 | 3.4% |
| H | 550 | 3.4% |
| Z | 467 | 2.9% |
| Other values (11) | 1438 | 8.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2387868 | |
| l | 1790901 | |
| i | 1193934 | |
| r | 1193934 | |
| c | 1193934 | |
| g | 596967 | 5.3% |
| u | 596967 | 5.3% |
| b | 596967 | 5.3% |
| d | 596967 | 5.3% |
| s | 596967 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 596967 | |
| 8 | 596967 | |
| 4 | 596967 | |
| 3 | 596967 | |
| 1 | 596967 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2387868 | |
| . | 596967 | 20.0% |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11358649 | |
| Common | 5969673 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2387868 | |
| l | 1790901 | |
| i | 1193934 | |
| r | 1193934 | |
| c | 1193934 | |
| g | 596967 | 5.3% |
| u | 596967 | 5.3% |
| b | 596967 | 5.3% |
| d | 596967 | 5.3% |
| s | 596967 | 5.3% |
| Other values (22) | 613243 | 5.4% |
Common
| Value | Count | Frequency (%) |
| : | 2387868 | |
| 7 | 596967 | 10.0% |
| 8 | 596967 | 10.0% |
| 4 | 596967 | 10.0% |
| 3 | 596967 | 10.0% |
| . | 596967 | 10.0% |
| 1 | 596967 | 10.0% |
| ? | 2 | < 0.1% |
| 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17328322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2387868 | |
| : | 2387868 | |
| l | 1790901 | 10.3% |
| i | 1193934 | 6.9% |
| r | 1193934 | 6.9% |
| c | 1193934 | 6.9% |
| g | 596967 | 3.4% |
| 7 | 596967 | 3.4% |
| 8 | 596967 | 3.4% |
| 4 | 596967 | 3.4% |
| Other values (31) | 4792015 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
|---|---|
| 2nd row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 3rd row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 4th row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| 5th row | urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 |
| Value | Count | Frequency (%) |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 3007255 | 11.1% |
| - | 2405804 | 8.9% |
| 5 | 2405804 | 8.9% |
| 6 | 1804353 | 6.7% |
| e | 1804353 | 6.7% |
| u | 1804353 | 6.7% |
| d | 1202902 | 4.4% |
| 9 | 1202902 | 4.4% |
| : | 1202902 | 4.4% |
| 1 | 1202902 | 4.4% |
| Other values (12) | 9021765 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13833373 | |
| Lowercase Letter | 9623216 | |
| Dash Punctuation | 2405804 | 8.9% |
| Other Punctuation | 1202902 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3007255 | |
| 5 | 2405804 | |
| 6 | 1804353 | |
| 9 | 1202902 | 8.7% |
| 1 | 1202902 | 8.7% |
| 4 | 1202902 | 8.7% |
| 2 | 1202902 | 8.7% |
| 0 | 601451 | 4.3% |
| 3 | 601451 | 4.3% |
| 7 | 601451 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1804353 | |
| u | 1804353 | |
| d | 1202902 | |
| b | 1202902 | |
| i | 601451 | 6.2% |
| a | 601451 | 6.2% |
| r | 601451 | 6.2% |
| n | 601451 | 6.2% |
| c | 601451 | 6.2% |
| f | 601451 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2405804 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17442079 | |
| Latin | 9623216 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 3007255 | |
| - | 2405804 | |
| 5 | 2405804 | |
| 6 | 1804353 | |
| 9 | 1202902 | 6.9% |
| : | 1202902 | 6.9% |
| 1 | 1202902 | 6.9% |
| 4 | 1202902 | 6.9% |
| 2 | 1202902 | 6.9% |
| 0 | 601451 | 3.4% |
| Other values (2) | 1202902 | 6.9% |
Latin
| Value | Count | Frequency (%) |
| e | 1804353 | |
| u | 1804353 | |
| d | 1202902 | |
| b | 1202902 | |
| i | 601451 | 6.2% |
| a | 601451 | 6.2% |
| r | 601451 | 6.2% |
| n | 601451 | 6.2% |
| c | 601451 | 6.2% |
| f | 601451 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27065295 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 3007255 | 11.1% |
| - | 2405804 | 8.9% |
| 5 | 2405804 | 8.9% |
| 6 | 1804353 | 6.7% |
| e | 1804353 | 6.7% |
| u | 1804353 | 6.7% |
| d | 1202902 | 4.4% |
| 9 | 1202902 | 4.4% |
| : | 1202902 | 4.4% |
| 1 | 1202902 | 4.4% |
| Other values (12) | 9021765 |
institutionCode
Text
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 3.997244996 |
| Min length | 2 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 596967 | |
| nsmt | 977 | 0.2% |
| uam | 775 | 0.1% |
| nrm | 386 | 0.1% |
| rmnh | 354 | 0.1% |
| rcs | 246 | < 0.1% |
| nmv | 238 | < 0.1% |
| nmsz | 188 | < 0.1% |
| zmmu | 179 | < 0.1% |
| fcmm | 127 | < 0.1% |
| Other values (40) | 1015 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (13) | 1441 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2404144 | |
| Other Punctuation | 2 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (11) | 1438 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2404144 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (11) | 1438 | 0.1% |
Common
| Value | Count | Frequency (%) |
| ? | 2 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2404147 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 601351 | |
| N | 599550 | |
| S | 598763 | |
| U | 598142 | |
| A | 1319 | 0.1% |
| R | 1035 | < 0.1% |
| T | 978 | < 0.1% |
| C | 551 | < 0.1% |
| H | 550 | < 0.1% |
| Z | 467 | < 0.1% |
| Other values (13) | 1441 | 0.1% |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MAMM |
|---|---|
| 2nd row | MAMM |
| 3rd row | MAMM |
| 4th row | MAMM |
| 5th row | MAMM |
| Value | Count | Frequency (%) |
| mamm | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2405804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2405804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2405804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1804353 | |
| A | 601451 | 25.0% |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 601451 | |
| extant | 601451 | |
| biology | 601451 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1202902 | 10.5% |
| 1202902 | 10.5% | |
| t | 1202902 | 10.5% |
| o | 1202902 | 10.5% |
| M | 601451 | 5.3% |
| H | 601451 | 5.3% |
| E | 601451 | 5.3% |
| x | 601451 | 5.3% |
| a | 601451 | 5.3% |
| n | 601451 | 5.3% |
| Other values (5) | 3007255 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6615961 | |
| Uppercase Letter | 3608706 | |
| Space Separator | 1202902 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1202902 | |
| o | 1202902 | |
| x | 601451 | |
| a | 601451 | |
| n | 601451 | |
| i | 601451 | |
| l | 601451 | |
| g | 601451 | |
| y | 601451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1202902 | |
| M | 601451 | |
| H | 601451 | |
| E | 601451 | |
| B | 601451 |
Space Separator
| Value | Count | Frequency (%) |
| 1202902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10224667 | |
| Common | 1202902 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1202902 | |
| t | 1202902 | |
| o | 1202902 | |
| M | 601451 | 5.9% |
| H | 601451 | 5.9% |
| E | 601451 | 5.9% |
| x | 601451 | 5.9% |
| a | 601451 | 5.9% |
| n | 601451 | 5.9% |
| B | 601451 | 5.9% |
| Other values (4) | 2405804 |
Common
| Value | Count | Frequency (%) |
| 1202902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11427569 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1202902 | 10.5% |
| 1202902 | 10.5% | |
| t | 1202902 | 10.5% |
| o | 1202902 | 10.5% |
| M | 601451 | 5.3% |
| H | 601451 | 5.3% |
| E | 601451 | 5.3% |
| x | 601451 | 5.3% |
| a | 601451 | 5.3% |
| n | 601451 | 5.3% |
| Other values (5) | 3007255 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 16.95205428 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | HumanObservation |
| Value | Count | Frequency (%) |
| preservedspecimen | 572614 | |
| humanobservation | 28837 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2891907 | |
| r | 1174065 | |
| n | 630288 | 6.2% |
| i | 601451 | 5.9% |
| s | 601451 | 5.9% |
| v | 601451 | 5.9% |
| m | 601451 | 5.9% |
| c | 572614 | 5.6% |
| P | 572614 | 5.6% |
| p | 572614 | 5.6% |
| Other values (9) | 1375924 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8992928 | |
| Uppercase Letter | 1202902 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2891907 | |
| r | 1174065 | |
| n | 630288 | 7.0% |
| i | 601451 | 6.7% |
| s | 601451 | 6.7% |
| v | 601451 | 6.7% |
| m | 601451 | 6.7% |
| c | 572614 | 6.4% |
| p | 572614 | 6.4% |
| d | 572614 | 6.4% |
| Other values (5) | 173022 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 572614 | |
| S | 572614 | |
| H | 28837 | 2.4% |
| O | 28837 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10195830 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2891907 | |
| r | 1174065 | |
| n | 630288 | 6.2% |
| i | 601451 | 5.9% |
| s | 601451 | 5.9% |
| v | 601451 | 5.9% |
| m | 601451 | 5.9% |
| c | 572614 | 5.6% |
| P | 572614 | 5.6% |
| p | 572614 | 5.6% |
| Other values (9) | 1375924 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10195830 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2891907 | |
| r | 1174065 | |
| n | 630288 | 6.2% |
| i | 601451 | 5.9% |
| s | 601451 | 5.9% |
| v | 601451 | 5.9% |
| m | 601451 | 5.9% |
| c | 572614 | 5.6% |
| P | 572614 | 5.6% |
| p | 572614 | 5.6% |
| Other values (9) | 1375924 |
occurrenceID
Text
Unique 
| Distinct | 601451 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 601451 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3ebec6a7f-5e95-4543-b061-6d73d80dd2ee |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3ec070d5d-1893-4600-afa5-e56695ff219b |
| 3rd row | http://n2t.net/ark:/65665/3002acaf9-9788-4539-8883-fe6bfd5f8d88 |
| 4th row | http://n2t.net/ark:/65665/300553499-1544-460e-9507-55ada241f992 |
| 5th row | http://n2t.net/ark:/65665/3005a3503-9c20-443c-899a-559e550dc71e |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3ebec6a7f-5e95-4543-b061-6d73d80dd2ee | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ecc76d35-e5c5-434e-874b-88c5d85dbb91 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ecff6276-27d1-4ad7-aac3-32c485b9bed6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3eceb4d85-2fbe-4bf2-aef7-b3393445f319 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300f96572-4f6d-48dc-9b78-1ba0e03bb0ae | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec5d68e1-4786-40d2-9bdb-bb8ef2ad056d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3002acaf9-9788-4539-8883-fe6bfd5f8d88 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300553499-1544-460e-9507-55ada241f992 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3005a3503-9c20-443c-899a-559e550dc71e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300664e6c-5334-4a8e-b9a7-4d84389595e0 | 1 | < 0.1% |
| Other values (601441) | 601441 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3007255 | 7.9% |
| 6 | 2930823 | 7.7% |
| - | 2405804 | 6.3% |
| t | 2405804 | 6.3% |
| 5 | 2330760 | 6.2% |
| a | 1878835 | 5.0% |
| e | 1729856 | 4.6% |
| 2 | 1729289 | 4.6% |
| 3 | 1728046 | 4.6% |
| 4 | 1727823 | 4.6% |
| Other values (16) | 16017118 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16387822 | |
| Lowercase Letter | 14286179 | |
| Other Punctuation | 4811608 | 12.7% |
| Dash Punctuation | 2405804 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2405804 | |
| a | 1878835 | |
| e | 1729856 | |
| b | 1278851 | |
| n | 1202902 | |
| f | 1128774 | |
| c | 1128212 | |
| d | 1127141 | |
| k | 601451 | 4.2% |
| r | 601451 | 4.2% |
| Other values (2) | 1202902 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2930823 | |
| 5 | 2330760 | |
| 2 | 1729289 | |
| 3 | 1728046 | |
| 4 | 1727823 | |
| 9 | 1279292 | |
| 8 | 1278534 | |
| 0 | 1129193 | 6.9% |
| 7 | 1127612 | 6.9% |
| 1 | 1126450 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3007255 | |
| : | 1202902 | 25.0% |
| . | 601451 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2405804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23605234 | |
| Latin | 14286179 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3007255 | |
| 6 | 2930823 | |
| - | 2405804 | |
| 5 | 2330760 | |
| 2 | 1729289 | |
| 3 | 1728046 | |
| 4 | 1727823 | |
| 9 | 1279292 | 5.4% |
| 8 | 1278534 | 5.4% |
| : | 1202902 | 5.1% |
| Other values (4) | 3984706 |
Latin
| Value | Count | Frequency (%) |
| t | 2405804 | |
| a | 1878835 | |
| e | 1729856 | |
| b | 1278851 | |
| n | 1202902 | |
| f | 1128774 | |
| c | 1128212 | |
| d | 1127141 | |
| k | 601451 | 4.2% |
| r | 601451 | 4.2% |
| Other values (2) | 1202902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37891413 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3007255 | 7.9% |
| 6 | 2930823 | 7.7% |
| - | 2405804 | 6.3% |
| t | 2405804 | 6.3% |
| 5 | 2330760 | 6.2% |
| a | 1878835 | 5.0% |
| e | 1729856 | 4.6% |
| 2 | 1729289 | 4.6% |
| 3 | 1728046 | 4.6% |
| 4 | 1727823 | 4.6% |
| Other values (16) | 16017118 |
catalogNumber
Text
| Distinct | 601428 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 10.92069179 |
| Min length | 4 |
Unique
| Unique | 601407 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | USNM 449558 |
|---|---|
| 2nd row | USNM 226903 |
| 3rd row | USNM 386480 |
| 4th row | USNM 68620 |
| 5th row | USNM MME9342 |
| Value | Count | Frequency (%) |
| usnm | 596967 | |
| wam | 63 | < 0.1% |
| mb | 40 | < 0.1% |
| zin | 21 | < 0.1% |
| lacm | 18 | < 0.1% |
| nsmt | 12 | < 0.1% |
| sama | 6 | < 0.1% |
| zmmu | 5 | < 0.1% |
| rmnh | 4 | < 0.1% |
| ncsm | 4 | < 0.1% |
| Other values (601439) | 601471 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | 9.4% |
| N | 601401 | 9.2% |
| U | 598144 | 9.1% |
| 597160 | 9.1% | |
| 1 | 405808 | 6.2% |
| 2 | 403390 | 6.1% |
| 3 | 394478 | 6.0% |
| 5 | 393693 | 6.0% |
| 4 | 379861 | 5.8% |
| Other values (25) | 1550327 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3465081 | |
| Uppercase Letter | 2506018 | |
| Space Separator | 597160 | 9.1% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | |
| N | 601401 | |
| U | 598144 | |
| R | 17298 | 0.7% |
| T | 17251 | 0.7% |
| E | 14721 | 0.6% |
| A | 10176 | 0.4% |
| C | 553 | < 0.1% |
| H | 550 | < 0.1% |
| Other values (13) | 1925 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 405808 | |
| 2 | 403390 | |
| 3 | 394478 | |
| 5 | 393693 | |
| 4 | 379861 | |
| 6 | 309193 | |
| 7 | 297996 | |
| 0 | 295420 | |
| 8 | 295286 | |
| 9 | 289956 |
Space Separator
| Value | Count | Frequency (%) |
| 597160 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4062243 | |
| Latin | 2506018 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | |
| N | 601401 | |
| U | 598144 | |
| R | 17298 | 0.7% |
| T | 17251 | 0.7% |
| E | 14721 | 0.6% |
| A | 10176 | 0.4% |
| C | 553 | < 0.1% |
| H | 550 | < 0.1% |
| Other values (13) | 1925 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 597160 | ||
| 1 | 405808 | |
| 2 | 403390 | |
| 3 | 394478 | |
| 5 | 393693 | |
| 4 | 379861 | |
| 6 | 309193 | |
| 7 | 297996 | |
| 0 | 295420 | |
| 8 | 295286 | |
| Other values (2) | 289958 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6568261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 627122 | |
| S | 616877 | 9.4% |
| N | 601401 | 9.2% |
| U | 598144 | 9.1% |
| 597160 | 9.1% | |
| 1 | 405808 | 6.2% |
| 2 | 403390 | 6.1% |
| 3 | 394478 | 6.0% |
| 5 | 393693 | 6.0% |
| 4 | 379861 | 5.8% |
| Other values (25) | 1550327 |
recordNumber
Text
Missing 
| Distinct | 172937 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 50821 |
| Missing (%) | 8.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 28 |
| Mean length | 5.176632221 |
| Min length | 1 |
Unique
| Unique | 147848 ? |
|---|---|
| Unique (%) | 26.9% |
Sample
| 1st row | FMG 2371 |
|---|---|
| 2nd row | 142/19534X |
| 3rd row | 07960 |
| 4th row | 6459 |
| 5th row | B47586/R50468 |
| Value | Count | Frequency (%) |
| no | 47434 | 6.9% |
| number | 47222 | 6.9% |
| cohjr | 5988 | 0.9% |
| nzp | 3372 | 0.5% |
| psc | 2713 | 0.4% |
| jwk | 2021 | 0.3% |
| r | 1947 | 0.3% |
| fm | 1793 | 0.3% |
| jjg | 1781 | 0.3% |
| rem | 1569 | 0.2% |
| Other values (105383) | 570874 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 307242 | 10.8% |
| 2 | 246234 | 8.6% |
| 3 | 208467 | 7.3% |
| 4 | 190900 | 6.7% |
| 0 | 182605 | 6.4% |
| 5 | 181877 | 6.4% |
| 6 | 173588 | 6.1% |
| 7 | 165796 | 5.8% |
| 8 | 159989 | 5.6% |
| 9 | 153227 | 5.4% |
| Other values (69) | 880484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1969925 | |
| Uppercase Letter | 409557 | 14.4% |
| Lowercase Letter | 285569 | 10.0% |
| Space Separator | 136084 | 4.8% |
| Other Punctuation | 26739 | 0.9% |
| Dash Punctuation | 20734 | 0.7% |
| Close Punctuation | 888 | < 0.1% |
| Open Punctuation | 886 | < 0.1% |
| Currency Symbol | 13 | < 0.1% |
| Math Symbol | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 106292 | |
| R | 28947 | 7.1% |
| M | 24702 | 6.0% |
| J | 23837 | 5.8% |
| C | 21743 | 5.3% |
| H | 19696 | 4.8% |
| X | 17857 | 4.4% |
| B | 15635 | 3.8% |
| P | 15412 | 3.8% |
| E | 14048 | 3.4% |
| Other values (16) | 121388 |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 47347 | |
| e | 47325 | |
| o | 47216 | |
| m | 47180 | |
| u | 47177 | |
| b | 47174 | |
| n | 1310 | 0.5% |
| a | 152 | 0.1% |
| p | 115 | < 0.1% |
| i | 108 | < 0.1% |
| Other values (13) | 465 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 307242 | |
| 2 | 246234 | |
| 3 | 208467 | |
| 4 | 190900 | |
| 0 | 182605 | |
| 5 | 181877 | |
| 6 | 173588 | |
| 7 | 165796 | |
| 8 | 159989 | |
| 9 | 153227 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 23475 | |
| . | 2050 | 7.7% |
| , | 626 | 2.3% |
| # | 248 | 0.9% |
| ? | 202 | 0.8% |
| & | 47 | 0.2% |
| ; | 44 | 0.2% |
| : | 22 | 0.1% |
| * | 21 | 0.1% |
| ' | 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 887 | |
| ] | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 885 | |
| [ | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 6 | |
| + | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 136084 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20734 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 13 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2155283 | |
| Latin | 695126 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 106292 | |
| r | 47347 | 6.8% |
| e | 47325 | 6.8% |
| o | 47216 | 6.8% |
| m | 47180 | 6.8% |
| u | 47177 | 6.8% |
| b | 47174 | 6.8% |
| R | 28947 | 4.2% |
| M | 24702 | 3.6% |
| J | 23837 | 3.4% |
| Other values (39) | 227929 |
Common
| Value | Count | Frequency (%) |
| 1 | 307242 | |
| 2 | 246234 | |
| 3 | 208467 | |
| 4 | 190900 | |
| 0 | 182605 | |
| 5 | 181877 | |
| 6 | 173588 | |
| 7 | 165796 | |
| 8 | 159989 | |
| 9 | 153227 | |
| Other values (20) | 185358 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2850396 | |
| None | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 307242 | 10.8% |
| 2 | 246234 | 8.6% |
| 3 | 208467 | 7.3% |
| 4 | 190900 | 6.7% |
| 0 | 182605 | 6.4% |
| 5 | 181877 | 6.4% |
| 6 | 173588 | 6.1% |
| 7 | 165796 | 5.8% |
| 8 | 159989 | 5.6% |
| 9 | 153227 | 5.4% |
| Other values (68) | 880471 |
None
| Value | Count | Frequency (%) |
| ¢ | 13 |
recordedBy
Text
Missing 
| Distinct | 17644 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 55563 |
| Missing (%) | 9.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 124 |
|---|---|
| Median length | 114 |
| Mean length | 11.92282483 |
| Min length | 1 |
Unique
| Unique | 9079 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | F. Greenwell |
|---|---|
| 2nd row | J. Silver |
| 3rd row | Smithsonian Venezuelan Project |
| 4th row | Nelson & E. Goldman |
| 5th row | W. Bowen & V. Thayer |
| Value | Count | Frequency (%) |
| j | 60783 | 4.7% |
| e | 54366 | 4.2% |
| c | 53496 | 4.2% |
| 50457 | 3.9% | |
| r | 49868 | 3.9% |
| a | 44074 | 3.4% |
| w | 37880 | 2.9% |
| h | 30720 | 2.4% |
| d | 24753 | 1.9% |
| m | 23831 | 1.9% |
| Other values (10447) | 856734 |
Most occurring characters
| Value | Count | Frequency (%) |
| 741074 | 11.4% | |
| e | 563544 | 8.7% |
| . | 539103 | 8.3% |
| n | 389678 | 6.0% |
| a | 341353 | 5.2% |
| o | 335107 | 5.1% |
| r | 327053 | 5.0% |
| l | 295446 | 4.5% |
| i | 245022 | 3.8% |
| s | 228632 | 3.5% |
| Other values (70) | 2502515 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3897970 | |
| Uppercase Letter | 1254996 | 19.3% |
| Space Separator | 741074 | 11.4% |
| Other Punctuation | 599060 | 9.2% |
| Close Punctuation | 5447 | 0.1% |
| Open Punctuation | 5376 | 0.1% |
| Dash Punctuation | 2452 | < 0.1% |
| Decimal Number | 2151 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 563544 | |
| n | 389678 | |
| a | 341353 | |
| o | 335107 | |
| r | 327053 | 8.4% |
| l | 295446 | 7.6% |
| i | 245022 | 6.3% |
| s | 228632 | 5.9% |
| t | 223935 | 5.7% |
| h | 116266 | 3.0% |
| Other values (18) | 831934 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 91216 | 7.3% |
| M | 88625 | 7.1% |
| C | 87417 | 7.0% |
| S | 86724 | 6.9% |
| H | 84189 | 6.7% |
| G | 82831 | 6.6% |
| J | 76177 | 6.1% |
| A | 70972 | 5.7% |
| E | 64988 | 5.2% |
| P | 62861 | 5.0% |
| Other values (16) | 458996 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 539103 | |
| & | 50656 | 8.5% |
| , | 8029 | 1.3% |
| ' | 1002 | 0.2% |
| / | 114 | < 0.1% |
| : | 78 | < 0.1% |
| ? | 29 | < 0.1% |
| " | 26 | < 0.1% |
| ; | 13 | < 0.1% |
| # | 10 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1561 | |
| 8 | 243 | 11.3% |
| 2 | 219 | 10.2% |
| 4 | 34 | 1.6% |
| 6 | 33 | 1.5% |
| 0 | 31 | 1.4% |
| 9 | 12 | 0.6% |
| 5 | 8 | 0.4% |
| 3 | 7 | 0.3% |
| 7 | 3 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5375 | |
| [ | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 741074 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5447 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2452 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5152966 | |
| Common | 1355561 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 563544 | 10.9% |
| n | 389678 | 7.6% |
| a | 341353 | 6.6% |
| o | 335107 | 6.5% |
| r | 327053 | 6.3% |
| l | 295446 | 5.7% |
| i | 245022 | 4.8% |
| s | 228632 | 4.4% |
| t | 223935 | 4.3% |
| h | 116266 | 2.3% |
| Other values (44) | 2086930 |
Common
| Value | Count | Frequency (%) |
| 741074 | ||
| . | 539103 | |
| & | 50656 | 3.7% |
| , | 8029 | 0.6% |
| ) | 5447 | 0.4% |
| ( | 5375 | 0.4% |
| - | 2452 | 0.2% |
| 1 | 1561 | 0.1% |
| ' | 1002 | 0.1% |
| 8 | 243 | < 0.1% |
| Other values (16) | 619 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6508521 | |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 741074 | 11.4% | |
| e | 563544 | 8.7% |
| . | 539103 | 8.3% |
| n | 389678 | 6.0% |
| a | 341353 | 5.2% |
| o | 335107 | 5.1% |
| r | 327053 | 5.0% |
| l | 295446 | 4.5% |
| i | 245022 | 3.8% |
| s | 228632 | 3.5% |
| Other values (68) | 2502509 |
None
| Value | Count | Frequency (%) |
| ç | 3 | |
| ā | 3 |
individualCount
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.000033255 |
| Min length | 1 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 601314 | |
| 2 | 45 | < 0.1% |
| 6 | 8 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 5 | 4 | < 0.1% |
| 271 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| Other values (11) | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 601427 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 601427 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 601427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 601326 | |
| 2 | 51 | < 0.1% |
| 6 | 9 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 0 | 7 | < 0.1% |
| 5 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
sex
Text
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 21 |
| Mean length | 5.271076114 |
| Min length | 1 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Female |
| Value | Count | Frequency (%) |
| male | 266476 | |
| female | 246781 | |
| unknown | 87925 | 14.6% |
| multiple | 279 | < 0.1% |
| animals | 279 | < 0.1% |
| of | 279 | < 0.1% |
| mixed | 279 | < 0.1% |
| sex | 279 | < 0.1% |
| 12 | < 0.1% | |
| f | 5 | < 0.1% |
| Other values (6) | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 760879 | |
| l | 514098 | |
| a | 513820 | |
| M | 266760 | 8.4% |
| n | 264058 | 8.3% |
| m | 247341 | 7.8% |
| F | 246786 | 7.8% |
| o | 88205 | 2.8% |
| w | 87927 | 2.8% |
| U | 87926 | 2.8% |
| Other values (17) | 92494 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2567611 | |
| Uppercase Letter | 601473 | 19.0% |
| Space Separator | 1153 | < 0.1% |
| Other Punctuation | 57 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 760879 | |
| l | 514098 | |
| a | 513820 | |
| n | 264058 | 10.3% |
| m | 247341 | 9.6% |
| o | 88205 | 3.4% |
| w | 87927 | 3.4% |
| k | 87926 | 3.4% |
| i | 839 | < 0.1% |
| s | 558 | < 0.1% |
| Other values (9) | 1960 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 266760 | |
| F | 246786 | |
| U | 87926 | 14.6% |
| P | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 41 | |
| ? | 15 | 26.3% |
| / | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3169084 | |
| Common | 1210 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 760879 | |
| l | 514098 | |
| a | 513820 | |
| M | 266760 | 8.4% |
| n | 264058 | 8.3% |
| m | 247341 | 7.8% |
| F | 246786 | 7.8% |
| o | 88205 | 2.8% |
| w | 87927 | 2.8% |
| U | 87926 | 2.8% |
| Other values (13) | 91284 | 2.9% |
Common
| Value | Count | Frequency (%) |
| 1153 | ||
| ; | 41 | 3.4% |
| ? | 15 | 1.2% |
| / | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3170294 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 760879 | |
| l | 514098 | |
| a | 513820 | |
| M | 266760 | 8.4% |
| n | 264058 | 8.3% |
| m | 247341 | 7.8% |
| F | 246786 | 7.8% |
| o | 88205 | 2.8% |
| w | 87927 | 2.8% |
| U | 87926 | 2.8% |
| Other values (17) | 92494 | 2.9% |
lifeStage
Text
Missing 
| Distinct | 91 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 549447 |
| Missing (%) | 91.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 5 |
| Mean length | 6.100703792 |
| Min length | 3 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Juvenile |
| 4th row | Juvenile |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 31151 | |
| juvenile | 9861 | 18.6% |
| immature | 3907 | 7.4% |
| subadult | 2173 | 4.1% |
| young | 1853 | 3.5% |
| embryo | 837 | 1.6% |
| fetus | 684 | 1.3% |
| old | 511 | 1.0% |
| nestling | 499 | 0.9% |
| neonate | 453 | 0.9% |
| Other values (55) | 1019 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 52076 | |
| l | 44573 | |
| t | 39354 | |
| d | 33936 | |
| A | 30440 | |
| e | 26241 | |
| n | 13312 | 4.2% |
| i | 10584 | 3.3% |
| v | 9888 | 3.1% |
| J | 9846 | 3.1% |
| Other values (35) | 47011 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 264129 | |
| Uppercase Letter | 51998 | 16.4% |
| Space Separator | 944 | 0.3% |
| Other Punctuation | 173 | 0.1% |
| Dash Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 52076 | |
| l | 44573 | |
| t | 39354 | |
| d | 33936 | |
| e | 26241 | |
| n | 13312 | 5.0% |
| i | 10584 | 4.0% |
| v | 9888 | 3.7% |
| m | 8822 | 3.3% |
| a | 7788 | 2.9% |
| Other values (13) | 17555 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 30440 | |
| J | 9846 | 18.9% |
| I | 3944 | 7.6% |
| S | 2221 | 4.3% |
| Y | 1917 | 3.7% |
| E | 998 | 1.9% |
| N | 995 | 1.9% |
| F | 707 | 1.4% |
| O | 511 | 1.0% |
| P | 154 | 0.3% |
| Other values (7) | 265 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 108 | |
| / | 46 | |
| ; | 19 | 11.0% |
Space Separator
| Value | Count | Frequency (%) |
| 944 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 316127 | |
| Common | 1134 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 52076 | |
| l | 44573 | |
| t | 39354 | |
| d | 33936 | |
| A | 30440 | |
| e | 26241 | |
| n | 13312 | 4.2% |
| i | 10584 | 3.3% |
| v | 9888 | 3.1% |
| J | 9846 | 3.1% |
| Other values (30) | 45877 |
Common
| Value | Count | Frequency (%) |
| 944 | ||
| ? | 108 | 9.5% |
| / | 46 | 4.1% |
| ; | 19 | 1.7% |
| - | 17 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 52076 | |
| l | 44573 | |
| t | 39354 | |
| d | 33936 | |
| A | 30440 | |
| e | 26241 | |
| n | 13312 | 4.2% |
| i | 10584 | 3.3% |
| v | 9888 | 3.1% |
| J | 9846 | 3.1% |
| Other values (35) | 47011 |
preparations
Text
Missing 
| Distinct | 542 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 26965 |
| Missing (%) | 4.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 73 |
|---|---|
| Median length | 11 |
| Mean length | 10.02423558 |
| Min length | 4 |
Unique
| Unique | 248 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Skin; Skull |
|---|---|
| 2nd row | Skin; Skull |
| 3rd row | Skin; Skull |
| 4th row | Skin; Skull |
| 5th row | Skin; Skull |
| Value | Count | Frequency (%) |
| skull | 452764 | |
| skin | 367609 | |
| fluid | 101452 | 10.0% |
| skeleton | 36584 | 3.6% |
| partial | 10316 | 1.0% |
| in | 8642 | 0.9% |
| remainder | 8641 | 0.9% |
| anatomical | 5878 | 0.6% |
| baculum/baubellum | 3372 | 0.3% |
| baleen | 2349 | 0.2% |
| Other values (42) | 14726 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| 437847 | ||
| n | 435543 | |
| ; | 404417 | 7.0% |
| d | 111124 | 1.9% |
| e | 103346 | 1.8% |
| Other values (39) | 397512 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3909067 | |
| Uppercase Letter | 1004072 | 17.4% |
| Space Separator | 437847 | 7.6% |
| Other Punctuation | 407794 | 7.1% |
| Decimal Number | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| u | 570461 | |
| i | 506031 | |
| n | 435543 | |
| d | 111124 | 2.8% |
| e | 103346 | 2.6% |
| t | 60548 | 1.5% |
| o | 55332 | 1.4% |
| a | 53911 | 1.4% |
| Other values (15) | 76928 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 856659 | |
| F | 101451 | 10.1% |
| P | 11688 | 1.2% |
| B | 9093 | 0.9% |
| R | 8650 | 0.9% |
| A | 6797 | 0.7% |
| T | 3295 | 0.3% |
| H | 2684 | 0.3% |
| O | 1310 | 0.1% |
| M | 940 | 0.1% |
| Other values (6) | 1505 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 404417 | |
| / | 3372 | 0.8% |
| , | 4 | < 0.1% |
| . | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 6 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 437847 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4913139 | |
| Common | 845644 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| n | 435543 | |
| d | 111124 | 2.3% |
| e | 103346 | 2.1% |
| F | 101451 | 2.1% |
| t | 60548 | 1.2% |
| Other values (31) | 232133 | 4.7% |
Common
| Value | Count | Frequency (%) |
| 437847 | ||
| ; | 404417 | |
| / | 3372 | 0.4% |
| , | 4 | < 0.1% |
| 5 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| + | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5758783 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1076304 | |
| k | 859539 | |
| S | 856659 | |
| u | 570461 | |
| i | 506031 | |
| 437847 | ||
| n | 435543 | |
| ; | 404417 | 7.0% |
| d | 111124 | 1.9% |
| e | 103346 | 1.8% |
| Other values (39) | 397512 | 6.9% |
associatedMedia
Text
Missing 
| Distinct | 48707 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 45503 |
| Missing (%) | 7.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 1263 |
|---|---|
| Median length | 49 |
| Mean length | 50.56994719 |
| Min length | 48 |
Unique
| Unique | 15254 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=14431681 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=14603706 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=14483098 |
| 4th row | https://collections.nmnh.si.edu/media/?i=14780717 |
| 5th row | https://collections.nmnh.si.edu/media/?i=14572646 |
| Value | Count | Frequency (%) |
| 14887746 | 84 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14563406 | 60 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561922 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561911 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561967 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561909 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561974 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561968 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561972 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=14561943 | 50 | < 0.1% |
| Other values (81691) | 643161 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2223792 | 7.9% |
| / | 2223792 | 7.9% |
| t | 1667844 | 5.9% |
| s | 1667844 | 5.9% |
| . | 1667844 | 5.9% |
| n | 1667844 | 5.9% |
| e | 1667844 | 5.9% |
| h | 1111896 | 4.0% |
| d | 1111896 | 4.0% |
| m | 1111896 | 4.0% |
| Other values (21) | 11991769 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17234388 | |
| Decimal Number | 5144879 | 18.3% |
| Other Punctuation | 5091289 | 18.1% |
| Math Symbol | 555948 | 2.0% |
| Space Separator | 87757 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2223792 | |
| t | 1667844 | |
| s | 1667844 | |
| n | 1667844 | |
| e | 1667844 | |
| h | 1111896 | 6.5% |
| d | 1111896 | 6.5% |
| m | 1111896 | 6.5% |
| l | 1111896 | 6.5% |
| o | 1111896 | 6.5% |
| Other values (4) | 2779740 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1029241 | |
| 4 | 925235 | |
| 6 | 489208 | |
| 7 | 463799 | |
| 5 | 412817 | |
| 0 | 384669 | 7.5% |
| 8 | 371873 | 7.2% |
| 3 | 367486 | 7.1% |
| 9 | 358952 | 7.0% |
| 2 | 341599 | 6.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2223792 | |
| . | 1667844 | |
| ? | 555948 | 10.9% |
| : | 555948 | 10.9% |
| ; | 87757 | 1.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 555948 |
Space Separator
| Value | Count | Frequency (%) |
| 87757 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17234388 | |
| Common | 10879873 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 2223792 | |
| . | 1667844 | |
| 1 | 1029241 | |
| 4 | 925235 | |
| ? | 555948 | 5.1% |
| = | 555948 | 5.1% |
| : | 555948 | 5.1% |
| 6 | 489208 | 4.5% |
| 7 | 463799 | 4.3% |
| 5 | 412817 | 3.8% |
| Other values (7) | 2000093 |
Latin
| Value | Count | Frequency (%) |
| i | 2223792 | |
| t | 1667844 | |
| s | 1667844 | |
| n | 1667844 | |
| e | 1667844 | |
| h | 1111896 | 6.5% |
| d | 1111896 | 6.5% |
| m | 1111896 | 6.5% |
| l | 1111896 | 6.5% |
| o | 1111896 | 6.5% |
| Other values (4) | 2779740 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28114261 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2223792 | 7.9% |
| / | 2223792 | 7.9% |
| t | 1667844 | 5.9% |
| s | 1667844 | 5.9% |
| . | 1667844 | 5.9% |
| n | 1667844 | 5.9% |
| e | 1667844 | 5.9% |
| h | 1111896 | 4.0% |
| d | 1111896 | 4.0% |
| m | 1111896 | 4.0% |
| Other values (21) | 11991769 |
Missing 
| Distinct | 1050 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 600397 |
| Missing (%) | 99.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 699 |
|---|---|
| Median length | 49 |
| Mean length | 99.59108159 |
| Min length | 47 |
Unique
| Unique | 1046 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AY922964|https://www.ncbi.nlm.nih.gov/gquery?term=AY922875 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=KC753815|https://www.ncbi.nlm.nih.gov/gquery?term=KC753933|https://www.ncbi.nlm.nih.gov/gquery?term=KC754042|https://www.ncbi.nlm.nih.gov/gquery?term=KC754162|https://www.ncbi.nlm.nih.gov/gquery?term=KC754280 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=KC011508|https://www.ncbi.nlm.nih.gov/gquery?term=KC011594|https://www.ncbi.nlm.nih.gov/gquery?term=KC011682 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MN707485|https://www.ncbi.nlm.nih.gov/gquery?term=MN707432 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ317640|https://www.ncbi.nlm.nih.gov/gquery?term=JQ317668 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu021073 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj383131 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kx998919 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu021074 | 2 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=dq178333|https://www.ncbi.nlm.nih.gov/gquery?term=dq178344 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay974630|https://www.ncbi.nlm.nih.gov/gquery?term=ay974676 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc753815|https://www.ncbi.nlm.nih.gov/gquery?term=kc753933|https://www.ncbi.nlm.nih.gov/gquery?term=kc754042|https://www.ncbi.nlm.nih.gov/gquery?term=kc754162|https://www.ncbi.nlm.nih.gov/gquery?term=kc754280 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc011508|https://www.ncbi.nlm.nih.gov/gquery?term=kc011594|https://www.ncbi.nlm.nih.gov/gquery?term=kc011682 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mn707485|https://www.ncbi.nlm.nih.gov/gquery?term=mn707432 | 1 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq317640|https://www.ncbi.nlm.nih.gov/gquery?term=jq317668 | 1 | 0.1% |
| Other values (1040) | 1040 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 8515 | 8.1% |
| / | 6360 | 6.1% |
| w | 6360 | 6.1% |
| n | 6360 | 6.1% |
| t | 6360 | 6.1% |
| h | 4240 | 4.0% |
| r | 4240 | 4.0% |
| e | 4240 | 4.0% |
| i | 4240 | 4.0% |
| m | 4240 | 4.0% |
| Other values (48) | 49814 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65720 | |
| Other Punctuation | 19115 | 18.2% |
| Decimal Number | 12730 | 12.1% |
| Uppercase Letter | 4213 | 4.0% |
| Math Symbol | 3186 | 3.0% |
| Connector Punctuation | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 814 | |
| M | 721 | |
| N | 422 | |
| Y | 404 | |
| A | 392 | |
| T | 258 | 6.1% |
| F | 237 | 5.6% |
| J | 212 | 5.0% |
| C | 171 | 4.1% |
| Q | 146 | 3.5% |
| Other values (12) | 436 |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 6360 | 9.7% |
| n | 6360 | 9.7% |
| t | 6360 | 9.7% |
| h | 4240 | 6.5% |
| r | 4240 | 6.5% |
| e | 4240 | 6.5% |
| i | 4240 | 6.5% |
| m | 4240 | 6.5% |
| g | 4240 | 6.5% |
| v | 2120 | 3.2% |
| Other values (9) | 19080 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1517 | |
| 3 | 1452 | |
| 6 | 1407 | |
| 9 | 1389 | |
| 2 | 1352 | |
| 4 | 1216 | |
| 8 | 1213 | |
| 1 | 1128 | |
| 5 | 1094 | |
| 0 | 962 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8515 | |
| / | 6360 | |
| ? | 2120 | 11.1% |
| : | 2120 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2120 | |
| | | 1066 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69933 | |
| Common | 35036 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 6360 | 9.1% |
| n | 6360 | 9.1% |
| t | 6360 | 9.1% |
| h | 4240 | 6.1% |
| r | 4240 | 6.1% |
| e | 4240 | 6.1% |
| i | 4240 | 6.1% |
| m | 4240 | 6.1% |
| g | 4240 | 6.1% |
| v | 2120 | 3.0% |
| Other values (31) | 23293 |
Common
| Value | Count | Frequency (%) |
| . | 8515 | |
| / | 6360 | |
| ? | 2120 | 6.1% |
| : | 2120 | 6.1% |
| = | 2120 | 6.1% |
| 7 | 1517 | 4.3% |
| 3 | 1452 | 4.1% |
| 6 | 1407 | 4.0% |
| 9 | 1389 | 4.0% |
| 2 | 1352 | 3.9% |
| Other values (7) | 6684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 8515 | 8.1% |
| / | 6360 | 6.1% |
| w | 6360 | 6.1% |
| n | 6360 | 6.1% |
| t | 6360 | 6.1% |
| h | 4240 | 4.0% |
| r | 4240 | 4.0% |
| e | 4240 | 4.0% |
| i | 4240 | 4.0% |
| m | 4240 | 4.0% |
| Other values (48) | 49814 |
Missing 
| Distinct | 5322 |
|---|---|
| Distinct (%) | 49.3% |
| Missing | 590662 |
| Missing (%) | 98.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 44804 |
|---|---|
| Median length | 2082 |
| Mean length | 214.0076003 |
| Min length | 4 |
Unique
| Unique | 4721 ? |
|---|---|
| Unique (%) | 43.8% |
Sample
| 1st row | From ledger catalogue 577876-577900: "field data recorded from field catalogues" |
|---|---|
| 2nd row | Skin found in rotunda hallway hold-up case, 2017. May need tanning before installation into collection. |
| 3rd row | Lectotype designated by Avila Pires (1968:163). |
| 4th row | Skull removed from alcoholic specimen. |
| 5th row | More than 800 dolphins stranded along a 220 km stretch pof the coast of Peru. See STR18239.; Broccetto, Marilia CNN website 22 IV 2012 |
| Value | Count | Frequency (%) |
| the | 13880 | 3.8% |
| of | 9359 | 2.6% |
| and | 7684 | 2.1% |
| in | 7077 | 1.9% |
| for | 6435 | 1.8% |
| to | 6041 | 1.6% |
| 4896 | 1.3% | |
| on | 4761 | 1.3% |
| was | 4231 | 1.2% |
| from | 3875 | 1.1% |
| Other values (19019) | 298259 |
Most occurring characters
| Value | Count | Frequency (%) |
| 355709 | ||
| e | 205843 | 8.9% |
| a | 147185 | 6.4% |
| t | 125245 | 5.4% |
| o | 122482 | 5.3% |
| n | 120296 | 5.2% |
| i | 111994 | 4.9% |
| s | 111800 | 4.8% |
| r | 110930 | 4.8% |
| l | 77896 | 3.4% |
| Other values (148) | 819548 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1587531 | |
| Space Separator | 355709 | 15.4% |
| Uppercase Letter | 132353 | 5.7% |
| Decimal Number | 122350 | 5.3% |
| Other Punctuation | 87540 | 3.8% |
| Dash Punctuation | 8132 | 0.4% |
| Close Punctuation | 6920 | 0.3% |
| Open Punctuation | 6894 | 0.3% |
| Math Symbol | 680 | < 0.1% |
| Connector Punctuation | 461 | < 0.1% |
| Other values (8) | 358 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 205843 | |
| a | 147185 | 9.3% |
| t | 125245 | 7.9% |
| o | 122482 | 7.7% |
| n | 120296 | 7.6% |
| i | 111994 | 7.1% |
| s | 111800 | 7.0% |
| r | 110930 | 7.0% |
| l | 77896 | 4.9% |
| d | 65194 | 4.1% |
| Other values (53) | 388666 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13793 | 10.4% |
| M | 11265 | 8.5% |
| N | 10762 | 8.1% |
| T | 10560 | 8.0% |
| C | 8190 | 6.2% |
| F | 7728 | 5.8% |
| I | 7523 | 5.7% |
| A | 7439 | 5.6% |
| B | 6332 | 4.8% |
| R | 5318 | 4.0% |
| Other values (18) | 43443 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36734 | |
| , | 26137 | |
| : | 6493 | 7.4% |
| " | 5631 | 6.4% |
| ; | 4846 | 5.5% |
| / | 3229 | 3.7% |
| ' | 1865 | 2.1% |
| # | 977 | 1.1% |
| & | 535 | 0.6% |
| ? | 299 | 0.3% |
| Other values (12) | 794 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20642 | |
| 0 | 20306 | |
| 2 | 20036 | |
| 5 | 10174 | |
| 9 | 10101 | |
| 7 | 9447 | |
| 6 | 8256 | 6.7% |
| 3 | 8246 | 6.7% |
| 4 | 7859 | 6.4% |
| 8 | 7283 | 6.0% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 207 | |
| + | 203 | |
| ~ | 120 | |
| < | 79 | 11.6% |
| > | 62 | 9.1% |
| | | 4 | 0.6% |
| ± | 2 | 0.3% |
| ¬ | 2 | 0.3% |
| − | 1 | 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 29 | |
| ¼ | 7 | 15.9% |
| ¹ | 5 | 11.4% |
| ¾ | 3 | 6.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7459 | |
| – | 656 | 8.1% |
| — | 17 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6315 | |
| ] | 602 | 8.7% |
| } | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6291 | |
| [ | 600 | 8.7% |
| { | 3 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 90 | |
| » | 1 | 1.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 48 | |
| ¥ | 10 | 17.2% |
Format
| Value | Count | Frequency (%) |
| | 3 | |
| | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 | |
| ^ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 355709 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 461 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 83 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 67 |
Other Letter
| Value | Count | Frequency (%) |
| º | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1719816 | |
| Common | 589036 | 25.5% |
| Greek | 76 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 205843 | 12.0% |
| a | 147185 | 8.6% |
| t | 125245 | 7.3% |
| o | 122482 | 7.1% |
| n | 120296 | 7.0% |
| i | 111994 | 6.5% |
| s | 111800 | 6.5% |
| r | 110930 | 6.5% |
| l | 77896 | 4.5% |
| d | 65194 | 3.8% |
| Other values (70) | 520951 |
Common
| Value | Count | Frequency (%) |
| 355709 | ||
| . | 36734 | 6.2% |
| , | 26137 | 4.4% |
| 1 | 20642 | 3.5% |
| 0 | 20306 | 3.4% |
| 2 | 20036 | 3.4% |
| 5 | 10174 | 1.7% |
| 9 | 10101 | 1.7% |
| 7 | 9447 | 1.6% |
| 6 | 8256 | 1.4% |
| Other values (56) | 71494 | 12.1% |
Greek
| Value | Count | Frequency (%) |
| μ | 64 | |
| ο | 2 | 2.6% |
| ή | 1 | 1.3% |
| ϊ | 1 | 1.3% |
| ι | 1 | 1.3% |
| ν | 1 | 1.3% |
| ρ | 1 | 1.3% |
| υ | 1 | 1.3% |
| δ | 1 | 1.3% |
| α | 1 | 1.3% |
| Other values (2) | 2 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2307432 | |
| Punctuation | 858 | < 0.1% |
| None | 637 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 355709 | ||
| e | 205843 | 8.9% |
| a | 147185 | 6.4% |
| t | 125245 | 5.4% |
| o | 122482 | 5.3% |
| n | 120296 | 5.2% |
| i | 111994 | 4.9% |
| s | 111800 | 4.8% |
| r | 110930 | 4.8% |
| l | 77896 | 3.4% |
| Other values (84) | 818052 |
Punctuation
| Value | Count | Frequency (%) |
| – | 656 | |
| ” | 90 | 10.5% |
| “ | 83 | 9.7% |
| — | 17 | 2.0% |
| • | 4 | 0.5% |
| | 3 | 0.3% |
| … | 2 | 0.2% |
| ″ | 2 | 0.2% |
| ′ | 1 | 0.1% |
None
| Value | Count | Frequency (%) |
| · | 170 | |
| é | 78 | |
| ° | 67 | 10.5% |
| μ | 64 | 10.0% |
| ì | 58 | 9.1% |
| ½ | 29 | 4.6% |
| è | 20 | 3.1% |
| Ö | 12 | 1.9% |
| ä | 10 | 1.6% |
| ü | 10 | 1.6% |
| Other values (44) | 119 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
eventDate
Text
Missing 
| Distinct | 46637 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 28127 |
| Missing (%) | 4.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.727609519 |
| Min length | 4 |
Unique
| Unique | 7673 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 1989-02-28 |
|---|---|
| 2nd row | 1917-08-08 |
| 3rd row | 1966-05 |
| 4th row | 1894-07-15 |
| 5th row | 1992-11-05 |
| Value | Count | Frequency (%) |
| 1968 | 1160 | 0.2% |
| 1959 | 769 | 0.1% |
| 1965-06 | 704 | 0.1% |
| 1966-06-02 | 682 | 0.1% |
| 1903 | 600 | 0.1% |
| 1905 | 591 | 0.1% |
| 1965 | 543 | 0.1% |
| 1967-08 | 537 | 0.1% |
| 1967-05 | 529 | 0.1% |
| 1968-09-02 | 520 | 0.1% |
| Other values (46627) | 566689 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1092495 | |
| - | 1092362 | |
| 0 | 833561 | |
| 9 | 717766 | |
| 2 | 392002 | 7.0% |
| 6 | 323478 | 5.8% |
| 8 | 309247 | 5.5% |
| 7 | 251593 | 4.5% |
| 3 | 195678 | 3.5% |
| 5 | 191966 | 3.4% |
| Other values (2) | 176924 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4483372 | |
| Dash Punctuation | 1092362 | 19.6% |
| Other Punctuation | 1338 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1092495 | |
| 0 | 833561 | |
| 9 | 717766 | |
| 2 | 392002 | 8.7% |
| 6 | 323478 | 7.2% |
| 8 | 309247 | 6.9% |
| 7 | 251593 | 5.6% |
| 3 | 195678 | 4.4% |
| 5 | 191966 | 4.3% |
| 4 | 175586 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1092362 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1338 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5577072 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1092495 | |
| - | 1092362 | |
| 0 | 833561 | |
| 9 | 717766 | |
| 2 | 392002 | 7.0% |
| 6 | 323478 | 5.8% |
| 8 | 309247 | 5.5% |
| 7 | 251593 | 4.5% |
| 3 | 195678 | 3.5% |
| 5 | 191966 | 3.4% |
| Other values (2) | 176924 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5577072 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1092495 | |
| - | 1092362 | |
| 0 | 833561 | |
| 9 | 717766 | |
| 2 | 392002 | 7.0% |
| 6 | 323478 | 5.8% |
| 8 | 309247 | 5.5% |
| 7 | 251593 | 4.5% |
| 3 | 195678 | 3.5% |
| 5 | 191966 | 3.4% |
| Other values (2) | 176924 | 3.2% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 46793 |
| Missing (%) | 7.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.724276942 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 59 |
|---|---|
| 2nd row | 220 |
| 3rd row | 151 |
| 4th row | 196 |
| 5th row | 310 |
| Value | Count | Frequency (%) |
| 181 | 3910 | 0.7% |
| 59 | 3214 | 0.6% |
| 243 | 3136 | 0.6% |
| 212 | 3000 | 0.5% |
| 151 | 2957 | 0.5% |
| 213 | 2690 | 0.5% |
| 120 | 2635 | 0.5% |
| 334 | 2476 | 0.4% |
| 193 | 2428 | 0.4% |
| 304 | 2382 | 0.4% |
| Other values (356) | 525830 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 288485 | |
| 2 | 278737 | |
| 3 | 192107 | |
| 5 | 114772 | 7.6% |
| 4 | 114286 | 7.6% |
| 6 | 108886 | 7.2% |
| 0 | 104494 | 6.9% |
| 7 | 103973 | 6.9% |
| 9 | 103679 | 6.9% |
| 8 | 101623 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1511042 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 288485 | |
| 2 | 278737 | |
| 3 | 192107 | |
| 5 | 114772 | 7.6% |
| 4 | 114286 | 7.6% |
| 6 | 108886 | 7.2% |
| 0 | 104494 | 6.9% |
| 7 | 103973 | 6.9% |
| 9 | 103679 | 6.9% |
| 8 | 101623 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1511042 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 288485 | |
| 2 | 278737 | |
| 3 | 192107 | |
| 5 | 114772 | 7.6% |
| 4 | 114286 | 7.6% |
| 6 | 108886 | 7.2% |
| 0 | 104494 | 6.9% |
| 7 | 103973 | 6.9% |
| 9 | 103679 | 6.9% |
| 8 | 101623 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1511042 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 288485 | |
| 2 | 278737 | |
| 3 | 192107 | |
| 5 | 114772 | 7.6% |
| 4 | 114286 | 7.6% |
| 6 | 108886 | 7.2% |
| 0 | 104494 | 6.9% |
| 7 | 103973 | 6.9% |
| 9 | 103679 | 6.9% |
| 8 | 101623 | 6.7% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 46765 |
| Missing (%) | 7.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.724321508 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 59 |
|---|---|
| 2nd row | 220 |
| 3rd row | 151 |
| 4th row | 196 |
| 5th row | 310 |
| Value | Count | Frequency (%) |
| 181 | 3912 | 0.7% |
| 59 | 3215 | 0.6% |
| 243 | 3146 | 0.6% |
| 151 | 3016 | 0.5% |
| 212 | 2960 | 0.5% |
| 213 | 2646 | 0.5% |
| 120 | 2638 | 0.5% |
| 334 | 2464 | 0.4% |
| 304 | 2406 | 0.4% |
| 222 | 2369 | 0.4% |
| Other values (356) | 525914 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 288287 | |
| 2 | 278817 | |
| 3 | 192047 | |
| 5 | 114832 | 7.6% |
| 4 | 114656 | 7.6% |
| 6 | 108777 | 7.2% |
| 0 | 104587 | 6.9% |
| 7 | 103968 | 6.9% |
| 9 | 103568 | 6.9% |
| 8 | 101604 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1511143 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 288287 | |
| 2 | 278817 | |
| 3 | 192047 | |
| 5 | 114832 | 7.6% |
| 4 | 114656 | 7.6% |
| 6 | 108777 | 7.2% |
| 0 | 104587 | 6.9% |
| 7 | 103968 | 6.9% |
| 9 | 103568 | 6.9% |
| 8 | 101604 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1511143 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 288287 | |
| 2 | 278817 | |
| 3 | 192047 | |
| 5 | 114832 | 7.6% |
| 4 | 114656 | 7.6% |
| 6 | 108777 | 7.2% |
| 0 | 104587 | 6.9% |
| 7 | 103968 | 6.9% |
| 9 | 103568 | 6.9% |
| 8 | 101604 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1511143 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 288287 | |
| 2 | 278817 | |
| 3 | 192047 | |
| 5 | 114832 | 7.6% |
| 4 | 114656 | 7.6% |
| 6 | 108777 | 7.2% |
| 0 | 104587 | 6.9% |
| 7 | 103968 | 6.9% |
| 9 | 103568 | 6.9% |
| 8 | 101604 | 6.7% |
year
Text
Missing 
| Distinct | 350 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 28127 |
| Missing (%) | 4.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 74 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1989 |
|---|---|
| 2nd row | 1917 |
| 3rd row | 1966 |
| 4th row | 1894 |
| 5th row | 1992 |
| Value | Count | Frequency (%) |
| 1967 | 30814 | 5.4% |
| 1968 | 27037 | 4.7% |
| 1966 | 22575 | 3.9% |
| 1969 | 15259 | 2.7% |
| 1965 | 12690 | 2.2% |
| 1964 | 12541 | 2.2% |
| 1962 | 11211 | 2.0% |
| 1970 | 10527 | 1.8% |
| 1916 | 9955 | 1.7% |
| 1963 | 9798 | 1.7% |
| Other values (340) | 410917 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 670022 | |
| 9 | 621720 | |
| 6 | 214950 | 9.4% |
| 8 | 199846 | 8.7% |
| 7 | 134632 | 5.9% |
| 0 | 133037 | 5.8% |
| 5 | 87362 | 3.8% |
| 2 | 86888 | 3.8% |
| 4 | 76111 | 3.3% |
| 3 | 68728 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2293296 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 670022 | |
| 9 | 621720 | |
| 6 | 214950 | 9.4% |
| 8 | 199846 | 8.7% |
| 7 | 134632 | 5.9% |
| 0 | 133037 | 5.8% |
| 5 | 87362 | 3.8% |
| 2 | 86888 | 3.8% |
| 4 | 76111 | 3.3% |
| 3 | 68728 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2293296 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 670022 | |
| 9 | 621720 | |
| 6 | 214950 | 9.4% |
| 8 | 199846 | 8.7% |
| 7 | 134632 | 5.9% |
| 0 | 133037 | 5.8% |
| 5 | 87362 | 3.8% |
| 2 | 86888 | 3.8% |
| 4 | 76111 | 3.3% |
| 3 | 68728 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2293296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 670022 | |
| 9 | 621720 | |
| 6 | 214950 | 9.4% |
| 8 | 199846 | 8.7% |
| 7 | 134632 | 5.9% |
| 0 | 133037 | 5.8% |
| 5 | 87362 | 3.8% |
| 2 | 86888 | 3.8% |
| 4 | 76111 | 3.3% |
| 3 | 68728 | 3.0% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 44866 |
| Missing (%) | 7.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.192750433 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 8 |
| 3rd row | 5 |
| 4th row | 7 |
| 5th row | 11 |
| Value | Count | Frequency (%) |
| 7 | 63622 | |
| 8 | 55632 | |
| 6 | 55508 | |
| 3 | 50988 | |
| 5 | 50119 | |
| 4 | 46824 | |
| 9 | 43994 | |
| 2 | 43078 | |
| 10 | 40461 | |
| 1 | 39538 | |
| Other values (2) | 66821 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 182088 | |
| 2 | 74631 | |
| 7 | 63622 | 9.6% |
| 8 | 55632 | 8.4% |
| 6 | 55508 | 8.4% |
| 3 | 50988 | 7.7% |
| 5 | 50119 | 7.5% |
| 4 | 46824 | 7.1% |
| 9 | 43994 | 6.6% |
| 0 | 40461 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663867 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 182088 | |
| 2 | 74631 | |
| 7 | 63622 | 9.6% |
| 8 | 55632 | 8.4% |
| 6 | 55508 | 8.4% |
| 3 | 50988 | 7.7% |
| 5 | 50119 | 7.5% |
| 4 | 46824 | 7.1% |
| 9 | 43994 | 6.6% |
| 0 | 40461 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663867 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 182088 | |
| 2 | 74631 | |
| 7 | 63622 | 9.6% |
| 8 | 55632 | 8.4% |
| 6 | 55508 | 8.4% |
| 3 | 50988 | 7.7% |
| 5 | 50119 | 7.5% |
| 4 | 46824 | 7.1% |
| 9 | 43994 | 6.6% |
| 0 | 40461 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663867 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 182088 | |
| 2 | 74631 | |
| 7 | 63622 | 9.6% |
| 8 | 55632 | 8.4% |
| 6 | 55508 | 8.4% |
| 3 | 50988 | 7.7% |
| 5 | 50119 | 7.5% |
| 4 | 46824 | 7.1% |
| 9 | 43994 | 6.6% |
| 0 | 40461 | 6.1% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 67482 |
| Missing (%) | 11.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.708157215 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 28 |
|---|---|
| 2nd row | 8 |
| 3rd row | 15 |
| 4th row | 5 |
| 5th row | 18 |
| Value | Count | Frequency (%) |
| 10 | 19188 | 3.6% |
| 20 | 18614 | 3.5% |
| 22 | 18464 | 3.5% |
| 15 | 18400 | 3.4% |
| 18 | 18199 | 3.4% |
| 14 | 18001 | 3.4% |
| 5 | 17933 | 3.4% |
| 16 | 17919 | 3.4% |
| 27 | 17835 | 3.3% |
| 21 | 17818 | 3.3% |
| Other values (21) | 351598 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 238401 | |
| 2 | 229686 | |
| 3 | 75593 | 8.3% |
| 5 | 53889 | 5.9% |
| 0 | 53247 | 5.8% |
| 8 | 53132 | 5.8% |
| 7 | 52819 | 5.8% |
| 6 | 52526 | 5.8% |
| 4 | 52154 | 5.7% |
| 9 | 50656 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 912103 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 238401 | |
| 2 | 229686 | |
| 3 | 75593 | 8.3% |
| 5 | 53889 | 5.9% |
| 0 | 53247 | 5.8% |
| 8 | 53132 | 5.8% |
| 7 | 52819 | 5.8% |
| 6 | 52526 | 5.8% |
| 4 | 52154 | 5.7% |
| 9 | 50656 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 912103 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 238401 | |
| 2 | 229686 | |
| 3 | 75593 | 8.3% |
| 5 | 53889 | 5.9% |
| 0 | 53247 | 5.8% |
| 8 | 53132 | 5.8% |
| 7 | 52819 | 5.8% |
| 6 | 52526 | 5.8% |
| 4 | 52154 | 5.7% |
| 9 | 50656 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 912103 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 238401 | |
| 2 | 229686 | |
| 3 | 75593 | 8.3% |
| 5 | 53889 | 5.9% |
| 0 | 53247 | 5.8% |
| 8 | 53132 | 5.8% |
| 7 | 52819 | 5.8% |
| 6 | 52526 | 5.8% |
| 4 | 52154 | 5.7% |
| 9 | 50656 | 5.6% |
Missing 
| Distinct | 45124 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 36490 |
| Missing (%) | 6.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 82 |
|---|---|
| Median length | 11 |
| Mean length | 10.73425953 |
| Min length | 3 |
Unique
| Unique | 7925 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 28 Feb 1989 |
|---|---|
| 2nd row | 8 Aug 1917 |
| 3rd row | -- May 1966 |
| 4th row | 15 Jul 1894 |
| 5th row | 5 Nov 1992 |
| Value | Count | Frequency (%) |
| 119289 | 7.0% | |
| jul | 59029 | 3.5% |
| aug | 52663 | 3.1% |
| jun | 52253 | 3.1% |
| mar | 49098 | 2.9% |
| may | 47959 | 2.8% |
| apr | 45015 | 2.6% |
| sep | 41961 | 2.5% |
| feb | 40432 | 2.4% |
| oct | 39123 | 2.3% |
| Other values (873) | 1153619 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | 10.6% |
| 2 | 290400 | 4.8% |
| - | 284559 | 4.7% |
| 6 | 256804 | 4.2% |
| 8 | 242113 | 4.0% |
| 7 | 176263 | 2.9% |
| u | 165038 | 2.7% |
| 0 | 163304 | 2.7% |
| Other values (65) | 1836694 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3034227 | |
| Space Separator | 1135480 | 18.7% |
| Lowercase Letter | 1072102 | 17.7% |
| Uppercase Letter | 534667 | 8.8% |
| Dash Punctuation | 284559 | 4.7% |
| Other Punctuation | 3387 | 0.1% |
| Close Punctuation | 7 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 165038 | |
| a | 133875 | |
| e | 114602 | |
| r | 97161 | |
| n | 90730 | |
| p | 87414 | |
| c | 68763 | |
| l | 60684 | 5.7% |
| g | 53357 | 5.0% |
| y | 47929 | 4.5% |
| Other values (14) | 152549 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 147559 | |
| A | 97950 | |
| M | 97151 | |
| S | 43634 | 8.2% |
| F | 41188 | 7.7% |
| O | 39198 | 7.3% |
| N | 33829 | 6.3% |
| D | 30011 | 5.6% |
| W | 1456 | 0.3% |
| E | 615 | 0.1% |
| Other values (13) | 2076 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 869039 | |
| 9 | 644744 | |
| 2 | 290400 | 9.6% |
| 6 | 256804 | 8.5% |
| 8 | 242113 | 8.0% |
| 7 | 176263 | 5.8% |
| 0 | 163304 | 5.4% |
| 3 | 136893 | 4.5% |
| 5 | 134478 | 4.4% |
| 4 | 120189 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 2267 | |
| , | 926 | |
| ? | 105 | 3.1% |
| : | 53 | 1.6% |
| / | 21 | 0.6% |
| . | 6 | 0.2% |
| ' | 5 | 0.1% |
| & | 2 | 0.1% |
| ; | 2 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 | |
| < | 1 | |
| ~ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 | |
| ] | 1 | 14.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 | |
| [ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1135480 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 284559 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4457669 | |
| Latin | 1606769 | 26.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 165038 | 10.3% |
| J | 147559 | 9.2% |
| a | 133875 | 8.3% |
| e | 114602 | 7.1% |
| A | 97950 | 6.1% |
| r | 97161 | 6.0% |
| M | 97151 | 6.0% |
| n | 90730 | 5.6% |
| p | 87414 | 5.4% |
| c | 68763 | 4.3% |
| Other values (37) | 506526 |
Common
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | |
| 2 | 290400 | 6.5% |
| - | 284559 | 6.4% |
| 6 | 256804 | 5.8% |
| 8 | 242113 | 5.4% |
| 7 | 176263 | 4.0% |
| 0 | 163304 | 3.7% |
| 3 | 136893 | 3.1% |
| Other values (18) | 258070 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6064438 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1135480 | ||
| 1 | 869039 | |
| 9 | 644744 | 10.6% |
| 2 | 290400 | 4.8% |
| - | 284559 | 4.7% |
| 6 | 256804 | 4.2% |
| 8 | 242113 | 4.0% |
| 7 | 176263 | 2.9% |
| u | 165038 | 2.7% |
| 0 | 163304 | 2.7% |
| Other values (65) | 1836694 |
habitat
Text
Missing 
| Distinct | 7512 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 468915 |
| Missing (%) | 78.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 1014 |
|---|---|
| Median length | 694 |
| Mean length | 27.3692808 |
| Min length | 1 |
Unique
| Unique | 4415 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Ecological remarks by collector(s): yes |
|---|---|
| 2nd row | Premontane very humid forest |
| 3rd row | Ecological remarks by collector(s): no |
| 4th row | Ecological remarks by collector(s): yes |
| 5th row | Culvert |
| Value | Count | Frequency (%) |
| by | 49297 | 9.4% |
| ecological | 48727 | 9.3% |
| remarks | 48718 | 9.3% |
| collector(s | 48716 | 9.3% |
| yes | 41564 | 8.0% |
| forest | 32139 | 6.2% |
| tropical | 15058 | 2.9% |
| humid | 14768 | 2.8% |
| no | 7275 | 1.4% |
| in | 6943 | 1.3% |
| Other values (3497) | 208498 |
Most occurring characters
| Value | Count | Frequency (%) |
| 389167 | 10.7% | |
| o | 316538 | 8.7% |
| e | 293307 | 8.1% |
| r | 281112 | 7.7% |
| l | 253946 | 7.0% |
| s | 244547 | 6.7% |
| c | 240040 | 6.6% |
| a | 233816 | 6.4% |
| i | 137021 | 3.8% |
| t | 136017 | 3.7% |
| Other values (76) | 1101904 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2931962 | |
| Space Separator | 389167 | 10.7% |
| Uppercase Letter | 134371 | 3.7% |
| Other Punctuation | 62424 | 1.7% |
| Open Punctuation | 49723 | 1.4% |
| Close Punctuation | 49712 | 1.4% |
| Decimal Number | 6872 | 0.2% |
| Dash Punctuation | 3142 | 0.1% |
| Math Symbol | 40 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 316538 | |
| e | 293307 | |
| r | 281112 | |
| l | 253946 | 8.7% |
| s | 244547 | 8.3% |
| c | 240040 | 8.2% |
| a | 233816 | 8.0% |
| i | 137021 | 4.7% |
| t | 136017 | 4.6% |
| y | 117063 | 4.0% |
| Other values (16) | 678555 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 49837 | |
| T | 18330 | 13.6% |
| S | 10045 | 7.5% |
| R | 7675 | 5.7% |
| P | 6589 | 4.9% |
| G | 6219 | 4.6% |
| C | 4362 | 3.2% |
| M | 4095 | 3.0% |
| A | 3747 | 2.8% |
| B | 3506 | 2.6% |
| Other values (16) | 19966 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 48943 | |
| , | 7291 | 11.7% |
| . | 4022 | 6.4% |
| ; | 832 | 1.3% |
| " | 403 | 0.6% |
| & | 381 | 0.6% |
| / | 229 | 0.4% |
| ? | 145 | 0.2% |
| ' | 102 | 0.2% |
| # | 62 | 0.1% |
| Other values (3) | 14 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2599 | |
| 1 | 1142 | |
| 2 | 872 | 12.7% |
| 3 | 636 | 9.3% |
| 5 | 469 | 6.8% |
| 4 | 334 | 4.9% |
| 8 | 251 | 3.7% |
| 6 | 220 | 3.2% |
| 7 | 185 | 2.7% |
| 9 | 164 | 2.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 49366 | |
| ] | 345 | 0.7% |
| } | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 33 | |
| + | 5 | 12.5% |
| ~ | 2 | 5.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 49378 | |
| [ | 345 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 389167 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3142 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3066333 | |
| Common | 561082 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 316538 | 10.3% |
| e | 293307 | 9.6% |
| r | 281112 | 9.2% |
| l | 253946 | 8.3% |
| s | 244547 | 8.0% |
| c | 240040 | 7.8% |
| a | 233816 | 7.6% |
| i | 137021 | 4.5% |
| t | 136017 | 4.4% |
| y | 117063 | 3.8% |
| Other values (42) | 812926 |
Common
| Value | Count | Frequency (%) |
| 389167 | ||
| ( | 49378 | 8.8% |
| ) | 49366 | 8.8% |
| : | 48943 | 8.7% |
| , | 7291 | 1.3% |
| . | 4022 | 0.7% |
| - | 3142 | 0.6% |
| 0 | 2599 | 0.5% |
| 1 | 1142 | 0.2% |
| 2 | 872 | 0.2% |
| Other values (24) | 5160 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3627413 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 389167 | 10.7% | |
| o | 316538 | 8.7% |
| e | 293307 | 8.1% |
| r | 281112 | 7.7% |
| l | 253946 | 7.0% |
| s | 244547 | 6.7% |
| c | 240040 | 6.6% |
| a | 233816 | 6.4% |
| i | 137021 | 3.8% |
| t | 136017 | 3.7% |
| Other values (75) | 1101902 |
Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
higherGeography
Text
| Distinct | 8925 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 440 |
| Missing (%) | 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 146 |
|---|---|
| Median length | 124 |
| Mean length | 39.09340095 |
| Min length | 4 |
Unique
| Unique | 3023 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | North America, Panama, Bocas Del Toro |
|---|---|
| 2nd row | North America, United States, Utah |
| 3rd row | South America, Venezuela, Bolivar |
| 4th row | North America, Mexico, Oaxaca |
| 5th row | North America, North Atlantic Ocean, United States, North Carolina, Carteret |
| Value | Count | Frequency (%) |
| america | 390243 | 12.4% |
| north | 378352 | 12.1% |
| united | 229925 | 7.3% |
| states | 225212 | 7.2% |
| africa | 111667 | 3.6% |
| south | 90792 | 2.9% |
| county | 80759 | 2.6% |
| asia | 66157 | 2.1% |
| ocean | 58408 | 1.9% |
| mexico | 50692 | 1.6% |
| Other values (5566) | 1452640 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2533836 | 10.8% | |
| a | 2342309 | 10.0% |
| i | 1683292 | 7.2% |
| t | 1628350 | 6.9% |
| e | 1586909 | 6.8% |
| r | 1444280 | 6.1% |
| , | 1372561 | 5.8% |
| o | 1263879 | 5.4% |
| n | 1236327 | 5.3% |
| c | 879180 | 3.7% |
| Other values (81) | 7524641 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16409922 | |
| Uppercase Letter | 3147373 | 13.4% |
| Space Separator | 2533836 | 10.8% |
| Other Punctuation | 1384733 | 5.9% |
| Dash Punctuation | 19470 | 0.1% |
| Open Punctuation | 106 | < 0.1% |
| Close Punctuation | 106 | < 0.1% |
| Decimal Number | 12 | < 0.1% |
| Modifier Letter | 5 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2342309 | |
| i | 1683292 | |
| t | 1628350 | |
| e | 1586909 | |
| r | 1444280 | |
| o | 1263879 | |
| n | 1236327 | |
| c | 879180 | 5.4% |
| s | 644321 | 3.9% |
| h | 637727 | 3.9% |
| Other values (35) | 3063348 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 694612 | |
| N | 456008 | |
| S | 407690 | |
| U | 266626 | 8.5% |
| C | 259605 | 8.2% |
| M | 141875 | 4.5% |
| P | 124642 | 4.0% |
| O | 99864 | 3.2% |
| B | 97350 | 3.1% |
| T | 70558 | 2.2% |
| Other values (17) | 528543 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1372561 | |
| ' | 7365 | 0.5% |
| . | 3951 | 0.3% |
| ? | 630 | < 0.1% |
| * | 122 | < 0.1% |
| / | 103 | < 0.1% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 2 | 4 | |
| 1 | 2 | |
| 0 | 1 | 8.3% |
| 8 | 1 | 8.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19466 | |
| – | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2533836 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 106 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 106 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 5 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19557295 | |
| Common | 3938269 | 16.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2342309 | 12.0% |
| i | 1683292 | 8.6% |
| t | 1628350 | 8.3% |
| e | 1586909 | 8.1% |
| r | 1444280 | 7.4% |
| o | 1263879 | 6.5% |
| n | 1236327 | 6.3% |
| c | 879180 | 4.5% |
| A | 694612 | 3.6% |
| s | 644321 | 3.3% |
| Other values (62) | 6153836 |
Common
| Value | Count | Frequency (%) |
| 2533836 | ||
| , | 1372561 | |
| - | 19466 | 0.5% |
| ' | 7365 | 0.2% |
| . | 3951 | 0.1% |
| ? | 630 | < 0.1% |
| * | 122 | < 0.1% |
| ( | 106 | < 0.1% |
| ) | 106 | < 0.1% |
| / | 103 | < 0.1% |
| Other values (9) | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23494048 | |
| None | 1504 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
| Latin Ext Additional | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2533836 | 10.8% | |
| a | 2342309 | 10.0% |
| i | 1683292 | 7.2% |
| t | 1628350 | 6.9% |
| e | 1586909 | 6.8% |
| r | 1444280 | 6.1% |
| , | 1372561 | 5.8% |
| o | 1263879 | 5.4% |
| n | 1236327 | 5.3% |
| c | 879180 | 3.7% |
| Other values (59) | 7523125 |
None
| Value | Count | Frequency (%) |
| é | 564 | |
| ó | 346 | |
| ä | 178 | 11.8% |
| í | 176 | 11.7% |
| ê | 104 | 6.9% |
| è | 57 | 3.8% |
| ô | 53 | 3.5% |
| ū | 5 | 0.3% |
| ā | 4 | 0.3% |
| Đ | 3 | 0.2% |
| Other values (9) | 14 | 0.9% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 3 |
continent
Text
| Distinct | 100 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 490 |
| Missing (%) | 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 13 |
| Mean length | 12.45328399 |
| Min length | 4 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North America |
|---|---|
| 2nd row | North America |
| 3rd row | South America |
| 4th row | North America |
| 5th row | North America, North Atlantic Ocean |
| Value | Count | Frequency (%) |
| america | 390237 | |
| north | 367501 | |
| africa | 99818 | 8.6% |
| south | 74300 | 6.4% |
| asia | 66157 | 5.7% |
| ocean | 58129 | 5.0% |
| atlantic | 30063 | 2.6% |
| pacific | 21536 | 1.9% |
| europe | 14885 | 1.3% |
| unknown | 13134 | 1.1% |
| Other values (9) | 26436 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 882026 | |
| a | 694459 | |
| i | 652876 | |
| c | 637049 | |
| A | 593130 | 7.9% |
| 561235 | 7.5% | |
| t | 524859 | 7.0% |
| o | 485683 | 6.5% |
| e | 466196 | 6.2% |
| h | 444531 | 5.9% |
| Other values (22) | 1541894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5712548 | |
| Uppercase Letter | 1162018 | 15.5% |
| Space Separator | 561235 | 7.5% |
| Other Punctuation | 48137 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 882026 | |
| a | 694459 | |
| i | 652876 | |
| c | 637049 | |
| t | 524859 | |
| o | 485683 | |
| e | 466196 | |
| h | 444531 | |
| m | 390237 | |
| n | 137405 | 2.4% |
| Other values (9) | 397227 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 593130 | |
| N | 367501 | |
| S | 77030 | 6.6% |
| O | 58344 | 5.0% |
| P | 21536 | 1.9% |
| E | 14885 | 1.3% |
| L | 13133 | 1.1% |
| U | 13133 | 1.1% |
| I | 3326 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 47959 | |
| ? | 142 | 0.3% |
| / | 36 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 561235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6874566 | |
| Common | 609372 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 882026 | |
| a | 694459 | |
| i | 652876 | |
| c | 637049 | |
| A | 593130 | |
| t | 524859 | |
| o | 485683 | |
| e | 466196 | |
| h | 444531 | |
| m | 390237 | 5.7% |
| Other values (18) | 1103520 |
Common
| Value | Count | Frequency (%) |
| 561235 | ||
| , | 47959 | 7.9% |
| ? | 142 | < 0.1% |
| / | 36 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7483938 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 882026 | |
| a | 694459 | |
| i | 652876 | |
| c | 637049 | |
| A | 593130 | 7.9% |
| 561235 | 7.5% | |
| t | 524859 | 7.0% |
| o | 485683 | 6.5% |
| e | 466196 | 6.2% |
| h | 444531 | 5.9% |
| Other values (22) | 1541894 |
waterBody
Text
Missing 
| Distinct | 1298 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 539858 |
| Missing (%) | 89.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 79 |
|---|---|
| Median length | 75 |
| Mean length | 24.02534379 |
| Min length | 6 |
Unique
| Unique | 776 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Pacific Ocean, Bering Sea |
| 3rd row | North Pacific Ocean |
| 4th row | North Atlantic Ocean, Gulf Of Mexico |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 58130 | |
| north | 49957 | |
| atlantic | 30063 | |
| pacific | 21536 | 9.4% |
| sea | 8710 | 3.8% |
| of | 8285 | 3.6% |
| gulf | 7277 | 3.2% |
| mexico | 6087 | 2.7% |
| south | 3736 | 1.6% |
| indian | 3443 | 1.5% |
| Other values (1047) | 32100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 167731 | ||
| a | 149650 | 10.1% |
| c | 142458 | 9.6% |
| t | 125319 | 8.5% |
| n | 116971 | 7.9% |
| i | 97425 | 6.6% |
| e | 90274 | 6.1% |
| o | 70318 | 4.8% |
| O | 66128 | 4.5% |
| r | 64946 | 4.4% |
| Other values (51) | 388573 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1060630 | |
| Uppercase Letter | 228943 | 15.5% |
| Space Separator | 167731 | 11.3% |
| Other Punctuation | 22340 | 1.5% |
| Dash Punctuation | 147 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 149650 | |
| c | 142458 | |
| t | 125319 | |
| n | 116971 | |
| i | 97425 | |
| e | 90274 | |
| o | 70318 | |
| r | 64946 | |
| h | 61407 | |
| l | 46029 | 4.3% |
| Other values (17) | 95833 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 66128 | |
| N | 50247 | |
| A | 32498 | |
| P | 22062 | 9.6% |
| S | 16927 | 7.4% |
| G | 7662 | 3.3% |
| C | 7479 | 3.3% |
| M | 7332 | 3.2% |
| B | 7248 | 3.2% |
| I | 3893 | 1.7% |
| Other values (15) | 7467 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22196 | |
| ? | 67 | 0.3% |
| . | 43 | 0.2% |
| ' | 33 | 0.1% |
| * | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 167731 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 147 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1289573 | |
| Common | 190220 | 12.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 149650 | |
| c | 142458 | |
| t | 125319 | |
| n | 116971 | 9.1% |
| i | 97425 | 7.6% |
| e | 90274 | 7.0% |
| o | 70318 | 5.5% |
| O | 66128 | 5.1% |
| r | 64946 | 5.0% |
| h | 61407 | 4.8% |
| Other values (42) | 304677 |
Common
| Value | Count | Frequency (%) |
| 167731 | ||
| , | 22196 | 11.7% |
| - | 147 | 0.1% |
| ? | 67 | < 0.1% |
| . | 43 | < 0.1% |
| ' | 33 | < 0.1% |
| * | 1 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1479792 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 167731 | ||
| a | 149650 | 10.1% |
| c | 142458 | 9.6% |
| t | 125319 | 8.5% |
| n | 116971 | 7.9% |
| i | 97425 | 6.6% |
| e | 90274 | 6.1% |
| o | 70318 | 4.8% |
| O | 66128 | 4.5% |
| r | 64946 | 4.4% |
| Other values (50) | 388572 |
None
| Value | Count | Frequency (%) |
| ö | 1 |
islandGroup
Text
Missing 
| Distinct | 68 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 596682 |
| Missing (%) | 99.2% |
| Memory size | 4.6 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 24 |
| Mean length | 13.28538478 |
| Min length | 8 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Pribilof Islands |
|---|---|
| 2nd row | Pribilof Islands |
| 3rd row | Ryukyu Islands |
| 4th row | Pribilof Islands |
| 5th row | Batan Islands |
| Value | Count | Frequency (%) |
| islands | 3374 | |
| pribilof | 1808 | |
| moluccas | 1194 | 14.4% |
| ryukyu | 497 | 6.0% |
| babuyan | 176 | 2.1% |
| channel | 159 | 1.9% |
| batan | 120 | 1.5% |
| nicobar | 108 | 1.3% |
| bismarck | 94 | 1.1% |
| yap | 83 | 1.0% |
| Other values (66) | 653 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | 10.6% |
| a | 6381 | 10.1% |
| n | 4444 | 7.0% |
| i | 4222 | 6.7% |
| d | 3521 | 5.6% |
| 3497 | 5.5% | |
| I | 3376 | 5.3% |
| o | 3353 | 5.3% |
| c | 2688 | 4.2% |
| Other values (36) | 17055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51599 | |
| Uppercase Letter | 8262 | 13.0% |
| Space Separator | 3497 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | |
| a | 6381 | |
| n | 4444 | |
| i | 4222 | |
| d | 3521 | |
| o | 3353 | |
| c | 2688 | 5.2% |
| u | 2566 | 5.0% |
| r | 2242 | 4.3% |
| Other values (14) | 7361 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3376 | |
| P | 1814 | |
| M | 1235 | 14.9% |
| R | 497 | 6.0% |
| B | 412 | 5.0% |
| C | 183 | 2.2% |
| S | 153 | 1.9% |
| A | 151 | 1.8% |
| N | 122 | 1.5% |
| Y | 83 | 1.0% |
| Other values (11) | 236 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 3497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59861 | |
| Common | 3497 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | |
| a | 6381 | |
| n | 4444 | 7.4% |
| i | 4222 | 7.1% |
| d | 3521 | 5.9% |
| I | 3376 | 5.6% |
| o | 3353 | 5.6% |
| c | 2688 | 4.5% |
| u | 2566 | 4.3% |
| Other values (35) | 14489 |
Common
| Value | Count | Frequency (%) |
| 3497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8103 | |
| l | 6718 | 10.6% |
| a | 6381 | 10.1% |
| n | 4444 | 7.0% |
| i | 4222 | 6.7% |
| d | 3521 | 5.6% |
| 3497 | 5.5% | |
| I | 3376 | 5.3% |
| o | 3353 | 5.3% |
| c | 2688 | 4.2% |
| Other values (36) | 17055 |
island
Text
Missing 
| Distinct | 345 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 564842 |
| Missing (%) | 93.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 21 |
| Mean length | 8.146903767 |
| Min length | 1 |
Unique
| Unique | 103 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | St. Paul Island |
|---|---|
| 2nd row | St. Paul Island |
| 3rd row | Trinidad |
| 4th row | Borneo |
| 5th row | Culion Island |
| Value | Count | Frequency (%) |
| island | 7184 | |
| borneo | 5932 | 12.2% |
| sumatra | 3675 | 7.5% |
| luzon | 3124 | 6.4% |
| java | 3005 | 6.2% |
| celebes | 2678 | 5.5% |
| trinidad | 2605 | 5.4% |
| st | 1818 | 3.7% |
| paul | 1799 | 3.7% |
| honshu | 1290 | 2.6% |
| Other values (366) | 15576 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 9.7% |
| o | 23778 | 8.0% |
| e | 21049 | 7.1% |
| r | 16512 | 5.5% |
| d | 15796 | 5.3% |
| l | 15656 | 5.2% |
| s | 14538 | 4.9% |
| u | 14063 | 4.7% |
| 12077 | 4.0% | |
| Other values (47) | 96371 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 235808 | |
| Uppercase Letter | 48529 | 16.3% |
| Space Separator | 12077 | 4.0% |
| Other Punctuation | 1830 | 0.6% |
| Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | |
| o | 23778 | |
| e | 21049 | |
| r | 16512 | |
| d | 15796 | 6.7% |
| l | 15656 | 6.6% |
| s | 14538 | 6.2% |
| u | 14063 | 6.0% |
| i | 11254 | 4.8% |
| Other values (16) | 34752 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 7839 | |
| B | 7137 | |
| S | 7123 | |
| L | 4203 | |
| C | 3825 | |
| P | 3689 | |
| T | 3258 | |
| J | 3022 | 6.2% |
| N | 2160 | 4.5% |
| H | 1664 | 3.4% |
| Other values (14) | 4609 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1817 | |
| ' | 9 | 0.5% |
| ? | 2 | 0.1% |
| * | 1 | 0.1% |
| , | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12077 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 284337 | |
| Common | 13913 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 10.1% |
| o | 23778 | 8.4% |
| e | 21049 | 7.4% |
| r | 16512 | 5.8% |
| d | 15796 | 5.6% |
| l | 15656 | 5.5% |
| s | 14538 | 5.1% |
| u | 14063 | 4.9% |
| i | 11254 | 4.0% |
| Other values (40) | 83281 |
Common
| Value | Count | Frequency (%) |
| 12077 | ||
| . | 1817 | 13.1% |
| ' | 9 | 0.1% |
| - | 6 | < 0.1% |
| ? | 2 | < 0.1% |
| * | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 298250 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 39564 | |
| n | 28846 | 9.7% |
| o | 23778 | 8.0% |
| e | 21049 | 7.1% |
| r | 16512 | 5.5% |
| d | 15796 | 5.3% |
| l | 15656 | 5.2% |
| s | 14538 | 4.9% |
| u | 14063 | 4.7% |
| 12077 | 4.0% | |
| Other values (47) | 96371 |
country
Text
Missing 
| Distinct | 322 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 6532 |
| Missing (%) | 1.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 33 |
| Mean length | 10.00060512 |
| Min length | 1 |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Panama |
|---|---|
| 2nd row | United States |
| 3rd row | Venezuela |
| 4th row | Mexico |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 229925 | |
| states | 225212 | |
| mexico | 34730 | 3.9% |
| panama | 25482 | 2.9% |
| venezuela | 24981 | 2.8% |
| canada | 19301 | 2.2% |
| colombia | 16624 | 1.9% |
| indonesia | 14922 | 1.7% |
| south | 12721 | 1.4% |
| brazil | 12246 | 1.4% |
| Other values (303) | 274156 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 752064 | |
| a | 718314 | |
| e | 679639 | |
| n | 502391 | 8.4% |
| i | 486974 | 8.2% |
| d | 307154 | 5.2% |
| 295381 | 5.0% | |
| s | 291189 | 4.9% |
| S | 249895 | 4.2% |
| U | 243908 | 4.1% |
| Other values (52) | 1422641 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4760116 | |
| Uppercase Letter | 886812 | 14.9% |
| Space Separator | 295381 | 5.0% |
| Other Punctuation | 7042 | 0.1% |
| Open Punctuation | 91 | < 0.1% |
| Close Punctuation | 91 | < 0.1% |
| Dash Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 752064 | |
| a | 718314 | |
| e | 679639 | |
| n | 502391 | |
| i | 486974 | |
| d | 307154 | |
| s | 291189 | 6.1% |
| o | 210105 | 4.4% |
| l | 114970 | 2.4% |
| r | 100369 | 2.1% |
| Other values (17) | 596947 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 249895 | |
| U | 243908 | |
| M | 59605 | 6.7% |
| C | 52665 | 5.9% |
| P | 45129 | 5.1% |
| B | 31355 | 3.5% |
| I | 29005 | 3.3% |
| V | 28088 | 3.2% |
| A | 20638 | 2.3% |
| G | 19590 | 2.2% |
| Other values (15) | 106934 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3154 | |
| . | 1894 | |
| , | 1725 | |
| ? | 243 | 3.5% |
| / | 25 | 0.4% |
| * | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 295381 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 91 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 91 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5646928 | |
| Common | 302622 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 752064 | |
| a | 718314 | |
| e | 679639 | |
| n | 502391 | |
| i | 486974 | |
| d | 307154 | 5.4% |
| s | 291189 | 5.2% |
| S | 249895 | 4.4% |
| U | 243908 | 4.3% |
| o | 210105 | 3.7% |
| Other values (42) | 1205295 |
Common
| Value | Count | Frequency (%) |
| 295381 | ||
| ' | 3154 | 1.0% |
| . | 1894 | 0.6% |
| , | 1725 | 0.6% |
| ? | 243 | 0.1% |
| ( | 91 | < 0.1% |
| ) | 91 | < 0.1% |
| / | 25 | < 0.1% |
| - | 17 | < 0.1% |
| * | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5949549 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 752064 | |
| a | 718314 | |
| e | 679639 | |
| n | 502391 | 8.4% |
| i | 486974 | 8.2% |
| d | 307154 | 5.2% |
| 295381 | 5.0% | |
| s | 291189 | 4.9% |
| S | 249895 | 4.2% |
| U | 243908 | 4.1% |
| Other values (51) | 1422640 |
None
| Value | Count | Frequency (%) |
| ç | 1 |
stateProvince
Text
Missing 
| Distinct | 1750 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 93954 |
| Missing (%) | 15.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 27 |
| Mean length | 9.156487625 |
| Min length | 1 |
Unique
| Unique | 314 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Bocas Del Toro |
|---|---|
| 2nd row | Utah |
| 3rd row | Bolivar |
| 4th row | Oaxaca |
| 5th row | North Carolina |
| Value | Count | Frequency (%) |
| california | 37958 | 5.7% |
| new | 18698 | 2.8% |
| alaska | 18000 | 2.7% |
| oregon | 15112 | 2.3% |
| province | 15077 | 2.2% |
| arizona | 13072 | 1.9% |
| virginia | 12189 | 1.8% |
| washington | 12057 | 1.8% |
| texas | 11524 | 1.7% |
| mexico | 9875 | 1.5% |
| Other values (1720) | 507096 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.4% |
| n | 356516 | 7.7% |
| o | 350614 | 7.5% |
| r | 326855 | 7.0% |
| e | 277944 | 6.0% |
| l | 192295 | 4.1% |
| s | 173201 | 3.7% |
| t | 172374 | 3.7% |
| 163161 | 3.5% | |
| Other values (65) | 1559858 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3782086 | |
| Uppercase Letter | 683335 | 14.7% |
| Space Separator | 163161 | 3.5% |
| Dash Punctuation | 15111 | 0.3% |
| Other Punctuation | 3190 | 0.1% |
| Decimal Number | 4 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | |
| n | 356516 | |
| o | 350614 | |
| r | 326855 | |
| e | 277944 | 7.3% |
| l | 192295 | 5.1% |
| s | 173201 | 4.6% |
| t | 172374 | 4.6% |
| u | 116650 | 3.1% |
| Other values (25) | 741565 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 96322 | |
| A | 66126 | 9.7% |
| N | 63963 | 9.4% |
| M | 54370 | 8.0% |
| S | 44892 | 6.6% |
| T | 39318 | 5.8% |
| P | 37886 | 5.5% |
| B | 35544 | 5.2% |
| W | 30828 | 4.5% |
| O | 27556 | 4.0% |
| Other values (16) | 186530 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 2998 | |
| ? | 159 | 5.0% |
| / | 21 | 0.7% |
| * | 6 | 0.2% |
| . | 5 | 0.2% |
| : | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 1 | |
| 8 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 163161 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15111 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4465421 | |
| Common | 181469 | 3.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.7% |
| n | 356516 | 8.0% |
| o | 350614 | 7.9% |
| r | 326855 | 7.3% |
| e | 277944 | 6.2% |
| l | 192295 | 4.3% |
| s | 173201 | 3.9% |
| t | 172374 | 3.9% |
| u | 116650 | 2.6% |
| Other values (51) | 1424900 |
Common
| Value | Count | Frequency (%) |
| 163161 | ||
| - | 15111 | 8.3% |
| ' | 2998 | 1.7% |
| ? | 159 | 0.1% |
| / | 21 | < 0.1% |
| * | 6 | < 0.1% |
| . | 5 | < 0.1% |
| 1 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| : | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4645873 | |
| None | 1017 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 685721 | |
| i | 388351 | 8.4% |
| n | 356516 | 7.7% |
| o | 350614 | 7.5% |
| r | 326855 | 7.0% |
| e | 277944 | 6.0% |
| l | 192295 | 4.1% |
| s | 173201 | 3.7% |
| t | 172374 | 3.7% |
| 163161 | 3.5% | |
| Other values (56) | 1558841 |
None
| Value | Count | Frequency (%) |
| é | 367 | |
| ó | 346 | |
| ä | 178 | |
| ê | 92 | 9.0% |
| ô | 30 | 2.9% |
| ç | 1 | 0.1% |
| ã | 1 | 0.1% |
| ō | 1 | 0.1% |
| æ | 1 | 0.1% |
county
Text
Missing 
| Distinct | 3194 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 447402 |
| Missing (%) | 74.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 27 |
| Mean length | 13.46725393 |
| Min length | 1 |
Unique
| Unique | 663 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Carteret |
|---|---|
| 2nd row | Cusco |
| 3rd row | Monterey County |
| 4th row | Galveston |
| 5th row | Tamana Ward |
| Value | Count | Frequency (%) |
| county | 80697 | |
| district | 13828 | 4.7% |
| islands | 3705 | 1.3% |
| division | 3460 | 1.2% |
| san | 3315 | 1.1% |
| province | 2619 | 0.9% |
| schoolcraft | 2179 | 0.7% |
| mackenzie | 1966 | 0.7% |
| lane | 1935 | 0.7% |
| municipality | 1862 | 0.6% |
| Other values (2969) | 178313 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 189818 | 9.1% |
| o | 175404 | 8.5% |
| t | 161467 | 7.8% |
| a | 160330 | 7.7% |
| 139830 | 6.7% | |
| i | 120188 | 5.8% |
| u | 116014 | 5.6% |
| e | 111686 | 5.4% |
| r | 102364 | 4.9% |
| C | 99007 | 4.8% |
| Other values (69) | 698509 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1630734 | |
| Uppercase Letter | 298270 | 14.4% |
| Space Separator | 139830 | 6.7% |
| Dash Punctuation | 4189 | 0.2% |
| Other Punctuation | 1555 | 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Decimal Number | 8 | < 0.1% |
| Modifier Letter | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 189818 | |
| o | 175404 | |
| t | 161467 | |
| a | 160330 | |
| i | 120188 | 7.4% |
| u | 116014 | 7.1% |
| e | 111686 | 6.8% |
| r | 102364 | 6.3% |
| y | 97639 | 6.0% |
| s | 76836 | 4.7% |
| Other values (28) | 318988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 99007 | |
| D | 27665 | 9.3% |
| S | 18077 | 6.1% |
| M | 17795 | 6.0% |
| B | 15214 | 5.1% |
| P | 13875 | 4.7% |
| A | 12422 | 4.2% |
| L | 11112 | 3.7% |
| G | 10792 | 3.6% |
| W | 8980 | 3.0% |
| Other values (17) | 63331 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1171 | |
| . | 192 | 12.3% |
| * | 113 | 7.3% |
| ? | 56 | 3.6% |
| / | 21 | 1.4% |
| , | 2 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4185 | |
| – | 4 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 4 | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 139830 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1929004 | |
| Common | 145613 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 189818 | 9.8% |
| o | 175404 | 9.1% |
| t | 161467 | 8.4% |
| a | 160330 | 8.3% |
| i | 120188 | 6.2% |
| u | 116014 | 6.0% |
| e | 111686 | 5.8% |
| r | 102364 | 5.3% |
| C | 99007 | 5.1% |
| y | 97639 | 5.1% |
| Other values (55) | 595087 |
Common
| Value | Count | Frequency (%) |
| 139830 | ||
| - | 4185 | 2.9% |
| ' | 1171 | 0.8% |
| . | 192 | 0.1% |
| * | 113 | 0.1% |
| ? | 56 | < 0.1% |
| / | 21 | < 0.1% |
| ) | 13 | < 0.1% |
| ( | 13 | < 0.1% |
| ʻ | 5 | < 0.1% |
| Other values (4) | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2074120 | |
| None | 485 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Punctuation | 4 | < 0.1% |
| Latin Ext Additional | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 189818 | 9.2% |
| o | 175404 | 8.5% |
| t | 161467 | 7.8% |
| a | 160330 | 7.7% |
| 139830 | 6.7% | |
| i | 120188 | 5.8% |
| u | 116014 | 5.6% |
| e | 111686 | 5.4% |
| r | 102364 | 4.9% |
| C | 99007 | 4.8% |
| Other values (54) | 698012 |
None
| Value | Count | Frequency (%) |
| é | 197 | |
| í | 176 | |
| è | 57 | 11.8% |
| ô | 23 | 4.7% |
| ê | 12 | 2.5% |
| ū | 5 | 1.0% |
| ā | 4 | 0.8% |
| Đ | 3 | 0.6% |
| ơ | 3 | 0.6% |
| à | 3 | 0.6% |
| Other values (2) | 2 | 0.4% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 5 |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ắ | 3 |
locality
Text
Missing 
| Distinct | 86656 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 35404 |
| Missing (%) | 5.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 294 |
|---|---|
| Median length | 159 |
| Mean length | 21.69044267 |
| Min length | 1 |
Unique
| Unique | 52764 ? |
|---|---|
| Unique (%) | 9.3% |
Sample
| 1st row | Tierra Oscura, 3.5 Km S. Tiger Key |
|---|---|
| 2nd row | Uinta Forest, Currant Creek |
| 3rd row | km. 125, 85 Km SSE El Dorado |
| 4th row | Totontepec |
| 5th row | Atlantic Beach, Atlantic Beach, 1/2 Mi E Of Triple S Pier. |
| Value | Count | Frequency (%) |
| km | 82857 | 3.9% |
| mi | 82389 | 3.8% |
| of | 34259 | 1.6% |
| n | 30440 | 1.4% |
| river | 28140 | 1.3% |
| s | 27057 | 1.3% |
| e | 26413 | 1.2% |
| w | 26172 | 1.2% |
| island | 23296 | 1.1% |
| san | 23251 | 1.1% |
| Other values (42744) | 1760837 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1579064 | 12.9% | |
| a | 1198873 | 9.8% |
| e | 766610 | 6.2% |
| i | 659790 | 5.4% |
| n | 655818 | 5.3% |
| o | 653029 | 5.3% |
| r | 550115 | 4.5% |
| l | 446951 | 3.6% |
| t | 434393 | 3.5% |
| , | 393002 | 3.2% |
| Other values (116) | 4940165 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7761808 | |
| Uppercase Letter | 2026861 | 16.5% |
| Space Separator | 1579064 | 12.9% |
| Other Punctuation | 489421 | 4.0% |
| Decimal Number | 361074 | 2.9% |
| Open Punctuation | 19801 | 0.2% |
| Close Punctuation | 19779 | 0.2% |
| Dash Punctuation | 15950 | 0.1% |
| Math Symbol | 3991 | < 0.1% |
| Connector Punctuation | 54 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1198873 | |
| e | 766610 | |
| i | 659790 | 8.5% |
| n | 655818 | 8.4% |
| o | 653029 | 8.4% |
| r | 550115 | 7.1% |
| l | 446951 | 5.8% |
| t | 434393 | 5.6% |
| s | 353920 | 4.6% |
| u | 324066 | 4.2% |
| Other values (49) | 1718243 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 227459 | 11.2% |
| M | 200123 | 9.9% |
| C | 146981 | 7.3% |
| N | 141675 | 7.0% |
| K | 124320 | 6.1% |
| R | 117188 | 5.8% |
| B | 112739 | 5.6% |
| P | 108902 | 5.4% |
| E | 107643 | 5.3% |
| W | 98686 | 4.9% |
| Other values (21) | 641145 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 393002 | |
| . | 71840 | 14.7% |
| ; | 9568 | 2.0% |
| ' | 6996 | 1.4% |
| / | 2669 | 0.5% |
| : | 2390 | 0.5% |
| " | 1272 | 0.3% |
| ? | 612 | 0.1% |
| & | 491 | 0.1% |
| # | 388 | 0.1% |
| Other values (3) | 193 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 75042 | |
| 2 | 56929 | |
| 5 | 50938 | |
| 0 | 37299 | |
| 3 | 35827 | |
| 4 | 29576 | 8.2% |
| 6 | 25291 | 7.0% |
| 8 | 19038 | 5.3% |
| 7 | 17890 | 5.0% |
| 9 | 13244 | 3.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3747 | |
| + | 184 | 4.6% |
| ~ | 60 | 1.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10013 | |
| [ | 9788 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9993 | |
| ] | 9786 |
Space Separator
| Value | Count | Frequency (%) |
| 1579064 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15950 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 54 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9788654 | |
| Common | 2489141 | 20.3% |
| Cyrillic | 15 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1198873 | 12.2% |
| e | 766610 | 7.8% |
| i | 659790 | 6.7% |
| n | 655818 | 6.7% |
| o | 653029 | 6.7% |
| r | 550115 | 5.6% |
| l | 446951 | 4.6% |
| t | 434393 | 4.4% |
| s | 353920 | 3.6% |
| u | 324066 | 3.3% |
| Other values (68) | 3745089 |
Common
| Value | Count | Frequency (%) |
| 1579064 | ||
| , | 393002 | 15.8% |
| 1 | 75042 | 3.0% |
| . | 71840 | 2.9% |
| 2 | 56929 | 2.3% |
| 5 | 50938 | 2.0% |
| 0 | 37299 | 1.5% |
| 3 | 35827 | 1.4% |
| 4 | 29576 | 1.2% |
| 6 | 25291 | 1.0% |
| Other values (26) | 134333 | 5.4% |
Cyrillic
| Value | Count | Frequency (%) |
| л | 3 | |
| к | 2 | |
| т | 1 | 6.7% |
| і | 1 | 6.7% |
| ө | 1 | 6.7% |
| ы | 1 | 6.7% |
| а | 1 | 6.7% |
| м | 1 | 6.7% |
| н | 1 | 6.7% |
| е | 1 | 6.7% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12277174 | |
| None | 619 | < 0.1% |
| Cyrillic | 15 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1579064 | 12.9% | |
| a | 1198873 | 9.8% |
| e | 766610 | 6.2% |
| i | 659790 | 5.4% |
| n | 655818 | 5.3% |
| o | 653029 | 5.3% |
| r | 550115 | 4.5% |
| l | 446951 | 3.6% |
| t | 434393 | 3.5% |
| , | 393002 | 3.2% |
| Other values (75) | 4939529 |
None
| Value | Count | Frequency (%) |
| é | 382 | |
| è | 107 | 17.3% |
| ø | 19 | 3.1% |
| ñ | 19 | 3.1% |
| á | 11 | 1.8% |
| ö | 11 | 1.8% |
| ã | 7 | 1.1% |
| ü | 7 | 1.1% |
| ó | 7 | 1.1% |
| Œ | 6 | 1.0% |
| Other values (18) | 43 | 6.9% |
Cyrillic
| Value | Count | Frequency (%) |
| л | 3 | |
| к | 2 | |
| т | 1 | 6.7% |
| і | 1 | 6.7% |
| ө | 1 | 6.7% |
| ы | 1 | 6.7% |
| а | 1 | 6.7% |
| м | 1 | 6.7% |
| н | 1 | 6.7% |
| е | 1 | 6.7% |
| Other values (2) | 2 |
Punctuation
| Value | Count | Frequency (%) |
| › | 2 |
Missing 
| Distinct | 1508 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 496901 |
| Missing (%) | 82.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.297771401 |
| Min length | 3 |
Unique
| Unique | 432 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1032.0 |
|---|---|
| 2nd row | 1006.0 |
| 3rd row | 545.0 |
| 4th row | 2134.0 |
| 5th row | 130.0 |
| Value | Count | Frequency (%) |
| 155.0 | 2555 | 2.4% |
| 150.0 | 2079 | 2.0% |
| 975.0 | 1931 | 1.8% |
| 1829.0 | 1925 | 1.8% |
| 1524.0 | 1732 | 1.7% |
| 1219.0 | 1705 | 1.6% |
| 2438.0 | 1490 | 1.4% |
| 2134.0 | 1369 | 1.3% |
| 914.0 | 1339 | 1.3% |
| 610.0 | 1184 | 1.1% |
| Other values (1495) | 87241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 160391 | |
| . | 104550 | |
| 1 | 64341 | |
| 2 | 42599 | 7.7% |
| 5 | 40675 | 7.3% |
| 3 | 28323 | 5.1% |
| 4 | 25625 | 4.6% |
| 7 | 24670 | 4.5% |
| 9 | 21989 | 4.0% |
| 6 | 20905 | 3.8% |
| Other values (2) | 19814 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 449325 | |
| Other Punctuation | 104550 | 18.9% |
| Dash Punctuation | 7 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 160391 | |
| 1 | 64341 | |
| 2 | 42599 | 9.5% |
| 5 | 40675 | 9.1% |
| 3 | 28323 | 6.3% |
| 4 | 25625 | 5.7% |
| 7 | 24670 | 5.5% |
| 9 | 21989 | 4.9% |
| 6 | 20905 | 4.7% |
| 8 | 19807 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104550 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 553882 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 160391 | |
| . | 104550 | |
| 1 | 64341 | |
| 2 | 42599 | 7.7% |
| 5 | 40675 | 7.3% |
| 3 | 28323 | 5.1% |
| 4 | 25625 | 4.6% |
| 7 | 24670 | 4.5% |
| 9 | 21989 | 4.0% |
| 6 | 20905 | 3.8% |
| Other values (2) | 19814 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 553882 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 160391 | |
| . | 104550 | |
| 1 | 64341 | |
| 2 | 42599 | 7.7% |
| 5 | 40675 | 7.3% |
| 3 | 28323 | 5.1% |
| 4 | 25625 | 4.6% |
| 7 | 24670 | 4.5% |
| 9 | 21989 | 4.0% |
| 6 | 20905 | 3.8% |
| Other values (2) | 19814 | 3.6% |
Missing 
| Distinct | 115 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 597572 |
| Missing (%) | 99.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.129156999 |
| Min length | 3 |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 1951.0 |
|---|---|
| 2nd row | 2835.0 |
| 3rd row | 61.0 |
| 4th row | 2200.0 |
| 5th row | 1500.0 |
| Value | Count | Frequency (%) |
| 76.0 | 652 | |
| 1500.0 | 427 | 11.0% |
| 152.0 | 278 | 7.2% |
| 914.0 | 240 | 6.2% |
| 2200.0 | 237 | 6.1% |
| 30.0 | 175 | 4.5% |
| 2010.0 | 156 | 4.0% |
| 488.0 | 143 | 3.7% |
| 400.0 | 138 | 3.6% |
| 305.0 | 120 | 3.1% |
| Other values (105) | 1313 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6753 | |
| . | 3879 | |
| 1 | 1956 | 9.8% |
| 2 | 1621 | 8.1% |
| 5 | 1289 | 6.5% |
| 6 | 978 | 4.9% |
| 7 | 921 | 4.6% |
| 4 | 876 | 4.4% |
| 3 | 675 | 3.4% |
| 8 | 516 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16017 | |
| Other Punctuation | 3879 | 19.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6753 | |
| 1 | 1956 | 12.2% |
| 2 | 1621 | 10.1% |
| 5 | 1289 | 8.0% |
| 6 | 978 | 6.1% |
| 7 | 921 | 5.8% |
| 4 | 876 | 5.5% |
| 3 | 675 | 4.2% |
| 8 | 516 | 3.2% |
| 9 | 432 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19896 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6753 | |
| . | 3879 | |
| 1 | 1956 | 9.8% |
| 2 | 1621 | 8.1% |
| 5 | 1289 | 6.5% |
| 6 | 978 | 4.9% |
| 7 | 921 | 4.6% |
| 4 | 876 | 4.4% |
| 3 | 675 | 3.4% |
| 8 | 516 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19896 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6753 | |
| . | 3879 | |
| 1 | 1956 | 9.8% |
| 2 | 1621 | 8.1% |
| 5 | 1289 | 6.5% |
| 6 | 978 | 4.9% |
| 7 | 921 | 4.6% |
| 4 | 876 | 4.4% |
| 3 | 675 | 3.4% |
| 8 | 516 | 2.6% |
Missing 
| Distinct | 29 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 599861 |
| Missing (%) | 99.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 8 |
| Mean length | 8.518867925 |
| Min length | 2 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | sea level |
|---|---|
| 2nd row | sealevel |
| 3rd row | sealevel |
| 4th row | sealevel |
| 5th row | see Osgood 1909:214 |
| Value | Count | Frequency (%) |
| sealevel | 1096 | |
| sea | 280 | 12.0% |
| level | 277 | 11.9% |
| ft | 143 | 6.1% |
| 104 | 4.5% | |
| 100 | 81 | 3.5% |
| m | 59 | 2.5% |
| near | 32 | 1.4% |
| below | 30 | 1.3% |
| 3 | 28 | 1.2% |
| Other values (33) | 206 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 10.9% |
| s | 1380 | 10.2% |
| v | 1376 | 10.2% |
| 746 | 5.5% | |
| 0 | 314 | 2.3% |
| t | 156 | 1.2% |
| 1 | 152 | 1.1% |
| f | 143 | 1.1% |
| Other values (33) | 807 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12018 | |
| Space Separator | 746 | 5.5% |
| Decimal Number | 555 | 4.1% |
| Math Symbol | 110 | 0.8% |
| Uppercase Letter | 87 | 0.6% |
| Dash Punctuation | 22 | 0.2% |
| Other Punctuation | 5 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 12.3% |
| s | 1380 | 11.5% |
| v | 1376 | 11.4% |
| t | 156 | 1.3% |
| f | 143 | 1.2% |
| c | 92 | 0.8% |
| m | 62 | 0.5% |
| r | 61 | 0.5% |
| Other values (12) | 277 | 2.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 314 | |
| 1 | 152 | |
| 3 | 52 | 9.4% |
| 5 | 16 | 2.9% |
| 2 | 10 | 1.8% |
| 7 | 6 | 1.1% |
| 9 | 3 | 0.5% |
| 4 | 2 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 28 | |
| G | 28 | |
| S | 28 | |
| M | 1 | 1.1% |
| K | 1 | 1.1% |
| O | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| : | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 746 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 110 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12105 | |
| Common | 1440 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 12.2% |
| s | 1380 | 11.4% |
| v | 1376 | 11.4% |
| t | 156 | 1.3% |
| f | 143 | 1.2% |
| c | 92 | 0.8% |
| m | 62 | 0.5% |
| r | 61 | 0.5% |
| Other values (18) | 364 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 746 | ||
| 0 | 314 | |
| 1 | 152 | 10.6% |
| < | 110 | 7.6% |
| 3 | 52 | 3.6% |
| - | 22 | 1.5% |
| 5 | 16 | 1.1% |
| 2 | 10 | 0.7% |
| 7 | 6 | 0.4% |
| 9 | 3 | 0.2% |
| Other values (5) | 9 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13545 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4198 | |
| l | 2792 | |
| a | 1481 | 10.9% |
| s | 1380 | 10.2% |
| v | 1376 | 10.2% |
| 746 | 5.5% | |
| 0 | 314 | 2.3% |
| t | 156 | 1.2% |
| 1 | 152 | 1.1% |
| f | 143 | 1.1% |
| Other values (33) | 807 | 6.0% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 601448 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.666666667 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 853.0 |
|---|---|
| 2nd row | 1600.0 |
| 3rd row | 1600.0 |
| Value | Count | Frequency (%) |
| 1600.0 | 2 | |
| 853.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Other Punctuation | 3 | 17.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 1 | 2 | 14.3% |
| 6 | 2 | 14.3% |
| 8 | 1 | 7.1% |
| 5 | 1 | 7.1% |
| 3 | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 1 | 2 | 11.8% |
| 6 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 3 | 1 | 5.9% |
decimalLatitude
Text
Missing 
| Distinct | 10264 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 448433 |
| Missing (%) | 74.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.04639977 |
| Min length | 3 |
Unique
| Unique | 4985 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | 5.98 |
|---|---|
| 2nd row | 34.68 |
| 3rd row | 31.5011 |
| 4th row | 29.37 |
| 5th row | 34.4863 |
| Value | Count | Frequency (%) |
| 5.3 | 1716 | 1.1% |
| 2.78 | 1090 | 0.7% |
| 5.67 | 1073 | 0.7% |
| 0.88 | 979 | 0.6% |
| 3.65 | 946 | 0.6% |
| 8.83 | 814 | 0.5% |
| 10.53 | 811 | 0.5% |
| 3.17 | 798 | 0.5% |
| 8.17 | 759 | 0.5% |
| 7.32 | 742 | 0.5% |
| Other values (9276) | 143290 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 153018 | |
| 3 | 85567 | |
| 2 | 76884 | |
| 1 | 68690 | |
| 5 | 67202 | |
| 8 | 61499 | |
| 7 | 57931 | 7.5% |
| 6 | 42374 | 5.5% |
| 0 | 42282 | 5.5% |
| 9 | 41004 | 5.3% |
| Other values (2) | 75739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 581467 | |
| Other Punctuation | 153018 | 19.8% |
| Dash Punctuation | 37705 | 4.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 85567 | |
| 2 | 76884 | |
| 1 | 68690 | |
| 5 | 67202 | |
| 8 | 61499 | |
| 7 | 57931 | |
| 6 | 42374 | |
| 0 | 42282 | |
| 9 | 41004 | |
| 4 | 38034 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 153018 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37705 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 772190 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 153018 | |
| 3 | 85567 | |
| 2 | 76884 | |
| 1 | 68690 | |
| 5 | 67202 | |
| 8 | 61499 | |
| 7 | 57931 | 7.5% |
| 6 | 42374 | 5.5% |
| 0 | 42282 | 5.5% |
| 9 | 41004 | 5.3% |
| Other values (2) | 75739 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 772190 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 153018 | |
| 3 | 85567 | |
| 2 | 76884 | |
| 1 | 68690 | |
| 5 | 67202 | |
| 8 | 61499 | |
| 7 | 57931 | 7.5% |
| 6 | 42374 | 5.5% |
| 0 | 42282 | 5.5% |
| 9 | 41004 | 5.3% |
| Other values (2) | 75739 |
decimalLongitude
Text
Missing 
| Distinct | 11880 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 448433 |
| Missing (%) | 74.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.651550798 |
| Min length | 3 |
Unique
| Unique | 5910 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | -61.43 |
|---|---|
| 2nd row | -76.7 |
| 3rd row | 65.8453 |
| 4th row | -94.82 |
| 5th row | 74.6026 |
| Value | Count | Frequency (%) |
| 66.22 | 1723 | 1.1% |
| 16.42 | 1090 | 0.7% |
| 127.68 | 955 | 0.6% |
| 0.2 | 930 | 0.6% |
| 70.5 | 790 | 0.5% |
| 71.95 | 739 | 0.5% |
| 79.62 | 722 | 0.5% |
| 0.22 | 681 | 0.4% |
| 0.97 | 651 | 0.4% |
| 66.18 | 629 | 0.4% |
| Other values (11070) | 144108 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 153018 | |
| - | 87916 | |
| 2 | 86605 | |
| 1 | 81267 | |
| 7 | 80986 | |
| 3 | 68421 | |
| 6 | 62202 | |
| 8 | 58615 | 6.8% |
| 5 | 58531 | 6.8% |
| 0 | 50551 | 5.8% |
| Other values (2) | 76677 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 623855 | |
| Other Punctuation | 153018 | 17.7% |
| Dash Punctuation | 87916 | 10.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 86605 | |
| 1 | 81267 | |
| 7 | 80986 | |
| 3 | 68421 | |
| 6 | 62202 | |
| 8 | 58615 | |
| 5 | 58531 | |
| 0 | 50551 | |
| 4 | 38663 | |
| 9 | 38014 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 153018 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 864789 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 153018 | |
| - | 87916 | |
| 2 | 86605 | |
| 1 | 81267 | |
| 7 | 80986 | |
| 3 | 68421 | |
| 6 | 62202 | |
| 8 | 58615 | 6.8% |
| 5 | 58531 | 6.8% |
| 0 | 50551 | 5.8% |
| Other values (2) | 76677 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 864789 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 153018 | |
| - | 87916 | |
| 2 | 86605 | |
| 1 | 81267 | |
| 7 | 80986 | |
| 3 | 68421 | |
| 6 | 62202 | |
| 8 | 58615 | 6.8% |
| 5 | 58531 | 6.8% |
| 0 | 50551 | 5.8% |
| Other values (2) | 76677 |
geodeticDatum
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 594543 |
| Missing (%) | 98.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.99681529 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WGS 84 (EPSG:4326) |
|---|---|
| 2nd row | WGS 84 (EPSG:4326) |
| 3rd row | WGS 84 (EPSG:4326) |
| 4th row | WGS 84 (EPSG:4326) |
| 5th row | WGS 84 (EPSG:4326) |
| Value | Count | Frequency (%) |
| wgs | 6906 | |
| 84 | 6906 | |
| epsg:4326 | 6906 | |
| unknown | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 13812 | |
| S | 13812 | |
| 13812 | ||
| 4 | 13812 | |
| W | 6906 | 5.6% |
| ) | 6906 | 5.6% |
| 6 | 6906 | 5.6% |
| 2 | 6906 | 5.6% |
| 3 | 6906 | 5.6% |
| : | 6906 | 5.6% |
| Other values (9) | 27638 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 48342 | |
| Decimal Number | 41436 | |
| Space Separator | 13812 | 11.1% |
| Close Punctuation | 6906 | 5.6% |
| Other Punctuation | 6906 | 5.6% |
| Open Punctuation | 6906 | 5.6% |
| Lowercase Letter | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 13812 | |
| S | 13812 | |
| W | 6906 | |
| P | 6906 | |
| E | 6906 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 13812 | |
| 6 | 6906 | |
| 2 | 6906 | |
| 3 | 6906 | |
| 8 | 6906 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 6 | |
| u | 2 | 14.3% |
| k | 2 | 14.3% |
| o | 2 | 14.3% |
| w | 2 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 13812 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6906 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 6906 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75966 | |
| Latin | 48356 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 13812 | |
| S | 13812 | |
| W | 6906 | |
| P | 6906 | |
| E | 6906 | |
| n | 6 | < 0.1% |
| u | 2 | < 0.1% |
| k | 2 | < 0.1% |
| o | 2 | < 0.1% |
| w | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 13812 | ||
| 4 | 13812 | |
| ) | 6906 | |
| 6 | 6906 | |
| 2 | 6906 | |
| 3 | 6906 | |
| : | 6906 | |
| ( | 6906 | |
| 8 | 6906 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 13812 | |
| S | 13812 | |
| 13812 | ||
| 4 | 13812 | |
| W | 6906 | 5.6% |
| ) | 6906 | 5.6% |
| 6 | 6906 | 5.6% |
| 2 | 6906 | 5.6% |
| 3 | 6906 | 5.6% |
| : | 6906 | 5.6% |
| Other values (9) | 27638 |
verbatimLatitude
Text
Missing 
| Distinct | 11921 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 466631 |
| Missing (%) | 77.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 10 |
| Mean length | 9.74341344 |
| Min length | 3 |
Unique
| Unique | 6353 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | 05 59 -- N |
|---|---|
| 2nd row | 34 41 4- N |
| 3rd row | 29 22 1- N |
| 4th row | 02 37 -- N |
| 5th row | 28 39 -- S |
| Value | Count | Frequency (%) |
| 106698 | ||
| n | 93430 | |
| s | 28456 | 5.7% |
| 10 | 13100 | 2.6% |
| 09 | 10526 | 2.1% |
| 08 | 8990 | 1.8% |
| 05 | 8987 | 1.8% |
| 07 | 8164 | 1.6% |
| 30 | 8001 | 1.6% |
| 06 | 7215 | 1.4% |
| Other values (2490) | 205546 |
Most occurring characters
| Value | Count | Frequency (%) |
| 364293 | ||
| - | 242090 | |
| 0 | 121688 | 9.3% |
| N | 102689 | 7.8% |
| 1 | 82952 | 6.3% |
| 2 | 72522 | 5.5% |
| 3 | 69271 | 5.3% |
| 5 | 58613 | 4.5% |
| 4 | 50109 | 3.8% |
| 9 | 32877 | 2.5% |
| Other values (21) | 116503 | 8.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 568428 | |
| Space Separator | 364293 | |
| Dash Punctuation | 242090 | |
| Uppercase Letter | 134294 | 10.2% |
| Other Punctuation | 3746 | 0.3% |
| Lowercase Letter | 412 | < 0.1% |
| Other Symbol | 344 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 121688 | |
| 1 | 82952 | |
| 2 | 72522 | |
| 3 | 69271 | |
| 5 | 58613 | |
| 4 | 50109 | |
| 9 | 32877 | 5.8% |
| 8 | 28469 | 5.0% |
| 7 | 27206 | 4.8% |
| 6 | 24721 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 102689 | |
| S | 31520 | 23.5% |
| W | 74 | 0.1% |
| E | 4 | < 0.1% |
| M | 3 | < 0.1% |
| O | 3 | < 0.1% |
| A | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2935 | |
| ' | 801 | 21.4% |
| ? | 6 | 0.2% |
| * | 2 | 0.1% |
| ; | 1 | < 0.1% |
| " | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 136 | |
| g | 136 | |
| e | 136 | |
| c | 2 | 0.5% |
| a | 2 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 364293 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 242090 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 344 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1178901 | |
| Latin | 134706 | 10.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 364293 | ||
| - | 242090 | |
| 0 | 121688 | 10.3% |
| 1 | 82952 | 7.0% |
| 2 | 72522 | 6.2% |
| 3 | 69271 | 5.9% |
| 5 | 58613 | 5.0% |
| 4 | 50109 | 4.3% |
| 9 | 32877 | 2.8% |
| 8 | 28469 | 2.4% |
| Other values (9) | 56017 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| N | 102689 | |
| S | 31520 | 23.4% |
| d | 136 | 0.1% |
| g | 136 | 0.1% |
| e | 136 | 0.1% |
| W | 74 | 0.1% |
| E | 4 | < 0.1% |
| M | 3 | < 0.1% |
| O | 3 | < 0.1% |
| c | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1313263 | |
| None | 344 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 364293 | ||
| - | 242090 | |
| 0 | 121688 | 9.3% |
| N | 102689 | 7.8% |
| 1 | 82952 | 6.3% |
| 2 | 72522 | 5.5% |
| 3 | 69271 | 5.3% |
| 5 | 58613 | 4.5% |
| 4 | 50109 | 3.8% |
| 9 | 32877 | 2.5% |
| Other values (20) | 116159 | 8.8% |
None
| Value | Count | Frequency (%) |
| ° | 344 |
Missing 
| Distinct | 13154 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 466723 |
| Missing (%) | 77.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 11 |
| Mean length | 10.73756012 |
| Min length | 3 |
Unique
| Unique | 7533 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | 061 26 -- W |
|---|---|
| 2nd row | 076 42 1- W |
| 3rd row | 094 49 4- W |
| 4th row | 066 19 -- W |
| 5th row | 020 15 -- E |
| Value | Count | Frequency (%) |
| 106940 | ||
| w | 73770 | 14.8% |
| e | 47858 | 9.6% |
| 000 | 6910 | 1.4% |
| 00 | 4768 | 1.0% |
| 46 | 4542 | 0.9% |
| 002 | 4510 | 0.9% |
| 13 | 4306 | 0.9% |
| 001 | 3732 | 0.7% |
| 066 | 3560 | 0.7% |
| Other values (2805) | 238004 |
Most occurring characters
| Value | Count | Frequency (%) |
| 364172 | ||
| - | 243702 | |
| 0 | 216851 | |
| 1 | 88638 | 6.1% |
| W | 80745 | 5.6% |
| 2 | 78239 | 5.4% |
| 3 | 58623 | 4.1% |
| 5 | 53520 | 3.7% |
| E | 53220 | 3.7% |
| 4 | 52072 | 3.6% |
| Other values (18) | 156868 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 700323 | |
| Space Separator | 364172 | |
| Dash Punctuation | 243702 | 16.8% |
| Uppercase Letter | 133992 | 9.3% |
| Other Punctuation | 3709 | 0.3% |
| Lowercase Letter | 408 | < 0.1% |
| Other Symbol | 344 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 216851 | |
| 1 | 88638 | |
| 2 | 78239 | 11.2% |
| 3 | 58623 | 8.4% |
| 5 | 53520 | 7.6% |
| 4 | 52072 | 7.4% |
| 6 | 50424 | 7.2% |
| 7 | 43836 | 6.3% |
| 8 | 32353 | 4.6% |
| 9 | 25767 | 3.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 80745 | |
| E | 53220 | |
| N | 16 | < 0.1% |
| S | 9 | < 0.1% |
| O | 1 | < 0.1% |
| C | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2897 | |
| ' | 801 | 21.6% |
| ? | 7 | 0.2% |
| * | 2 | 0.1% |
| " | 1 | < 0.1% |
| ; | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 136 | |
| e | 136 | |
| g | 136 |
Space Separator
| Value | Count | Frequency (%) |
| 364172 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 243702 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 344 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1312250 | |
| Latin | 134400 | 9.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 364172 | ||
| - | 243702 | |
| 0 | 216851 | |
| 1 | 88638 | 6.8% |
| 2 | 78239 | 6.0% |
| 3 | 58623 | 4.5% |
| 5 | 53520 | 4.1% |
| 4 | 52072 | 4.0% |
| 6 | 50424 | 3.8% |
| 7 | 43836 | 3.3% |
| Other values (9) | 62173 | 4.7% |
Latin
| Value | Count | Frequency (%) |
| W | 80745 | |
| E | 53220 | |
| d | 136 | 0.1% |
| e | 136 | 0.1% |
| g | 136 | 0.1% |
| N | 16 | < 0.1% |
| S | 9 | < 0.1% |
| O | 1 | < 0.1% |
| C | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1446306 | |
| None | 344 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 364172 | ||
| - | 243702 | |
| 0 | 216851 | |
| 1 | 88638 | 6.1% |
| W | 80745 | 5.6% |
| 2 | 78239 | 5.4% |
| 3 | 58623 | 4.1% |
| 5 | 53520 | 3.7% |
| E | 53220 | 3.7% |
| 4 | 52072 | 3.6% |
| Other values (17) | 156524 |
None
| Value | Count | Frequency (%) |
| ° | 344 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 468202 |
| Missing (%) | 77.8% |
| Memory size | 4.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.96475771 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 133004 | |
| minutes | 133003 | |
| seconds | 133003 | |
| utm | 192 | < 0.1% |
| unknown | 53 | < 0.1% |
| decimal | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | 8.7% |
| 266007 | 8.7% | |
| M | 133195 | 4.4% |
| o | 133056 | 4.3% |
| D | 133004 | 4.3% |
| c | 133004 | 4.3% |
| g | 133004 | 4.3% |
| r | 133004 | 4.3% |
| Other values (12) | 665563 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2394385 | |
| Uppercase Letter | 399639 | 13.1% |
| Space Separator | 266007 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | |
| o | 133056 | 5.6% |
| c | 133004 | 5.6% |
| g | 133004 | 5.6% |
| r | 133004 | 5.6% |
| i | 133004 | 5.6% |
| d | 133004 | 5.6% |
| t | 133003 | 5.6% |
| Other values (6) | 133112 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 133195 | |
| D | 133004 | |
| S | 133003 | |
| U | 245 | 0.1% |
| T | 192 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 266007 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2794024 | |
| Common | 266007 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | |
| M | 133195 | 4.8% |
| o | 133056 | 4.8% |
| D | 133004 | 4.8% |
| c | 133004 | 4.8% |
| g | 133004 | 4.8% |
| r | 133004 | 4.8% |
| i | 133004 | 4.8% |
| Other values (11) | 532559 |
Common
| Value | Count | Frequency (%) |
| 266007 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3060031 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 665019 | |
| s | 399010 | |
| n | 266165 | 8.7% |
| 266007 | 8.7% | |
| M | 133195 | 4.4% |
| o | 133056 | 4.3% |
| D | 133004 | 4.3% |
| c | 133004 | 4.3% |
| g | 133004 | 4.3% |
| r | 133004 | 4.3% |
| Other values (12) | 665563 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 592196 |
| Missing (%) | 98.5% |
| Memory size | 4.6 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 12 |
| Mean length | 10.66731496 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Earth |
|---|---|
| 2nd row | Google Earth |
| 3rd row | GPS |
| 4th row | Google Earth |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 7074 | ||
| earth | 7074 | |
| gps | 1418 | 8.3% |
| usgs | 530 | 3.1% |
| topoview | 530 | 3.1% |
| gazetteer | 137 | 0.8% |
| atlas | 42 | 0.2% |
| of | 42 | 0.2% |
| canada | 42 | 0.2% |
| 42 | 0.2% | |
| Other values (4) | 96 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| 7772 | ||
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| Other values (22) | 14326 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 69543 | |
| Uppercase Letter | 21368 | 21.6% |
| Space Separator | 7772 | 7.9% |
| Dash Punctuation | 42 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 15334 | |
| e | 8096 | |
| t | 8000 | |
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| p | 586 | 0.8% |
| w | 530 | 0.8% |
| Other values (8) | 958 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 9159 | |
| E | 7074 | |
| S | 2478 | 11.6% |
| P | 1418 | 6.6% |
| V | 530 | 2.5% |
| U | 530 | 2.5% |
| A | 42 | 0.2% |
| C | 42 | 0.2% |
| T | 42 | 0.2% |
| I | 39 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 7772 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90911 | |
| Common | 7815 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| E | 7074 | |
| Other values (19) | 7209 |
Common
| Value | Count | Frequency (%) |
| 7772 | ||
| - | 42 | 0.5% |
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 15334 | |
| G | 9159 | |
| e | 8096 | |
| t | 8000 | |
| 7772 | ||
| a | 7479 | |
| r | 7294 | |
| l | 7116 | |
| h | 7076 | |
| g | 7074 | |
| Other values (22) | 14326 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 11.8% |
| Missing | 601383 |
| Missing (%) | > 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 35 |
| Mean length | 31.20588235 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 5.9% |
Sample
| 1st row | Garmin Etrex Vista HCX, Datum WGS84 |
|---|---|
| 2nd row | Garmin Etrex Vista HCX, Datum WGS84 |
| 3rd row | Garmin Etrex Vista HCX, Datum WGS84 |
| 4th row | Garmin Etrex Vista HCX, Datum WGS84 |
| 5th row | Garmin Etrex Vista HCX, Datum WGS84 |
| Value | Count | Frequency (%) |
| garmin | 54 | |
| etrex | 54 | |
| vista | 54 | |
| hcx | 54 | |
| datum | 54 | |
| wgs84 | 54 | |
| camp | 7 | 2.0% |
| coordinates | 7 | 2.0% |
| for | 6 | 1.7% |
| longitude | 2 | 0.6% |
| Other values (7) | 12 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 290 | 13.7% | |
| a | 184 | 8.7% |
| t | 175 | 8.2% |
| r | 132 | 6.2% |
| i | 123 | 5.8% |
| m | 118 | 5.6% |
| G | 108 | 5.1% |
| e | 73 | 3.4% |
| n | 67 | 3.2% |
| s | 62 | 2.9% |
| Other values (24) | 790 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1118 | |
| Uppercase Letter | 551 | |
| Space Separator | 290 | 13.7% |
| Decimal Number | 108 | 5.1% |
| Other Punctuation | 55 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 184 | |
| t | 175 | |
| r | 132 | |
| i | 123 | |
| m | 118 | |
| e | 73 | 6.5% |
| n | 67 | 6.0% |
| s | 62 | 5.5% |
| u | 58 | 5.2% |
| x | 56 | 5.0% |
| Other values (8) | 70 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 108 | |
| C | 61 | |
| S | 54 | |
| W | 54 | |
| D | 54 | |
| X | 54 | |
| H | 54 | |
| V | 54 | |
| E | 54 | |
| L | 2 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 54 | |
| 8 | 54 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 54 | |
| ; | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1669 | |
| Common | 453 | 21.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 184 | 11.0% |
| t | 175 | 10.5% |
| r | 132 | 7.9% |
| i | 123 | 7.4% |
| m | 118 | 7.1% |
| G | 108 | 6.5% |
| e | 73 | 4.4% |
| n | 67 | 4.0% |
| s | 62 | 3.7% |
| C | 61 | 3.7% |
| Other values (19) | 566 |
Common
| Value | Count | Frequency (%) |
| 290 | ||
| 4 | 54 | 11.9% |
| 8 | 54 | 11.9% |
| , | 54 | 11.9% |
| ; | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2122 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 290 | 13.7% | |
| a | 184 | 8.7% |
| t | 175 | 8.2% |
| r | 132 | 6.2% |
| i | 123 | 5.8% |
| m | 118 | 5.6% |
| G | 108 | 5.1% |
| e | 73 | 3.4% |
| n | 67 | 3.2% |
| s | 62 | 2.9% |
| Other values (24) | 790 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 599947 |
| Missing (%) | 99.7% |
| Memory size | 4.6 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.412234043 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | uncertain |
|---|---|
| 2nd row | uncertain |
| 3rd row | uncertain |
| 4th row | uncertain |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| uncertain | 1355 | |
| cf | 147 | 9.8% |
| sp | 2 | 0.1% |
| near | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| . | 149 | 1.2% |
| f | 147 | 1.2% |
| Other values (4) | 46 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12461 | |
| Other Punctuation | 149 | 1.2% |
| Uppercase Letter | 40 | 0.3% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| f | 147 | 1.2% |
| s | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 149 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 40 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12501 | |
| Common | 151 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| f | 147 | 1.2% |
| U | 40 | 0.3% |
| Other values (2) | 4 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 149 | |
| 2 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 2712 | |
| c | 1502 | |
| e | 1357 | |
| r | 1357 | |
| a | 1357 | |
| t | 1355 | |
| i | 1355 | |
| u | 1315 | |
| . | 149 | 1.2% |
| f | 147 | 1.2% |
| Other values (4) | 46 | 0.4% |
typeStatus
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 597685 |
| Missing (%) | 99.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 4 |
| Mean length | 4.250929368 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Lectotype |
|---|---|
| 2nd row | Type |
| 3rd row | Type |
| 4th row | Type |
| 5th row | Type |
| Value | Count | Frequency (%) |
| type | 3590 | |
| syntype | 83 | 2.2% |
| lectotype | 68 | 1.8% |
| renamed | 28 | 0.7% |
| neotype | 12 | 0.3% |
| holotype | 12 | 0.3% |
| nomen | 2 | 0.1% |
| nudem | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3905 | |
| y | 3848 | |
| p | 3765 | |
| T | 3590 | |
| t | 243 | 1.5% |
| n | 113 | 0.7% |
| o | 106 | 0.7% |
| S | 83 | 0.5% |
| L | 68 | 0.4% |
| c | 68 | 0.4% |
| Other values (10) | 220 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12152 | |
| Uppercase Letter | 3797 | 23.7% |
| Space Separator | 31 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3905 | |
| y | 3848 | |
| p | 3765 | |
| t | 243 | 2.0% |
| n | 113 | 0.9% |
| o | 106 | 0.9% |
| c | 68 | 0.6% |
| m | 32 | 0.3% |
| d | 30 | 0.2% |
| a | 28 | 0.2% |
| Other values (2) | 14 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3590 | |
| S | 83 | 2.2% |
| L | 68 | 1.8% |
| R | 28 | 0.7% |
| N | 16 | 0.4% |
| H | 12 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 31 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15949 | |
| Common | 60 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3905 | |
| y | 3848 | |
| p | 3765 | |
| T | 3590 | |
| t | 243 | 1.5% |
| n | 113 | 0.7% |
| o | 106 | 0.7% |
| S | 83 | 0.5% |
| L | 68 | 0.4% |
| c | 68 | 0.4% |
| Other values (8) | 160 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 31 | ||
| ; | 29 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16009 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3905 | |
| y | 3848 | |
| p | 3765 | |
| T | 3590 | |
| t | 243 | 1.5% |
| n | 113 | 0.7% |
| o | 106 | 0.7% |
| S | 83 | 0.5% |
| L | 68 | 0.4% |
| c | 68 | 0.4% |
| Other values (10) | 220 | 1.4% |
identifiedBy
Text
Missing 
| Distinct | 95 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 593267 |
| Missing (%) | 98.6% |
| Memory size | 4.6 MiB |
Length
| Max length | 132 |
|---|---|
| Median length | 124 |
| Mean length | 94.36840176 |
| Min length | 10 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | O'Neill, Jennifer K., Fort Hayes State University |
|---|---|
| 2nd row | Gardner, Alfred L., Curator (USGS), United States Geological Survey (UNITED STATES) |
| 3rd row | Woodman, Neal, (USGS), United States Geological Survey (UNITED STATES) |
| 4th row | Lunde, Darrin P., Collections Manager (MAM), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 5th row | Reeder, DeeAnn M., Bucknell University (UNITED STATES) |
| Value | Count | Frequency (%) |
| states | 8033 | 7.9% |
| united | 8033 | 7.9% |
| of | 5420 | 5.3% |
| museum | 5255 | 5.2% |
| natural | 5077 | 5.0% |
| history | 5077 | 5.0% |
| national | 5064 | 5.0% |
| smithsonian | 5007 | 4.9% |
| institution | 5007 | 4.9% |
| 4859 | 4.8% | |
| Other values (272) | 44753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 93401 | 12.1% | |
| t | 49895 | 6.5% |
| o | 47659 | 6.2% |
| i | 45409 | 5.9% |
| a | 41696 | 5.4% |
| e | 39504 | 5.1% |
| n | 38647 | 5.0% |
| s | 36580 | 4.7% |
| r | 29451 | 3.8% |
| u | 25575 | 3.3% |
| Other values (48) | 324494 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 444828 | |
| Uppercase Letter | 174361 | 22.6% |
| Space Separator | 93401 | 12.1% |
| Other Punctuation | 28613 | 3.7% |
| Open Punctuation | 13070 | 1.7% |
| Close Punctuation | 13070 | 1.7% |
| Dash Punctuation | 4968 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 49895 | |
| o | 47659 | |
| i | 45409 | |
| a | 41696 | |
| e | 39504 | |
| n | 38647 | |
| s | 36580 | |
| r | 29451 | |
| u | 25575 | 5.7% |
| l | 25043 | 5.6% |
| Other values (15) | 65369 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 24263 | |
| T | 20751 | |
| M | 20290 | |
| N | 18192 | |
| E | 15013 | |
| A | 13295 | |
| I | 12131 | |
| U | 9985 | |
| D | 9079 | 5.2% |
| H | 8379 | 4.8% |
| Other values (14) | 22983 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22186 | |
| . | 6353 | 22.2% |
| ' | 69 | 0.2% |
| ; | 4 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 93401 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13070 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13070 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 619189 | |
| Common | 153122 | 19.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 49895 | 8.1% |
| o | 47659 | 7.7% |
| i | 45409 | 7.3% |
| a | 41696 | 6.7% |
| e | 39504 | 6.4% |
| n | 38647 | 6.2% |
| s | 36580 | 5.9% |
| r | 29451 | 4.8% |
| u | 25575 | 4.1% |
| l | 25043 | 4.0% |
| Other values (39) | 239730 |
Common
| Value | Count | Frequency (%) |
| 93401 | ||
| , | 22186 | 14.5% |
| ( | 13070 | 8.5% |
| ) | 13070 | 8.5% |
| . | 6353 | 4.1% |
| - | 4968 | 3.2% |
| ' | 69 | < 0.1% |
| ; | 4 | < 0.1% |
| & | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 772311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 93401 | 12.1% | |
| t | 49895 | 6.5% |
| o | 47659 | 6.2% |
| i | 45409 | 5.9% |
| a | 41696 | 5.4% |
| e | 39504 | 5.1% |
| n | 38647 | 5.0% |
| s | 36580 | 4.7% |
| r | 29451 | 3.8% |
| u | 25575 | 3.3% |
| Other values (48) | 324494 |
scientificName
Text
| Distinct | 7805 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 43 |
| Mean length | 22.61255364 |
| Min length | 5 |
Unique
| Unique | 898 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Potos flavus |
|---|---|
| 2nd row | Microtus longicaudus longicaudus |
| 3rd row | Carollia brevicauda |
| 4th row | Peromyscus mexicanus totontepecus |
| 5th row | Tursiops truncatus |
| Value | Count | Frequency (%) |
| peromyscus | 38753 | 2.6% |
| sp | 28343 | 1.9% |
| rattus | 21929 | 1.5% |
| microtus | 19877 | 1.3% |
| maniculatus | 15880 | 1.1% |
| sorex | 15831 | 1.1% |
| artibeus | 12470 | 0.8% |
| carollia | 12281 | 0.8% |
| tursiops | 11895 | 0.8% |
| truncatus | 11875 | 0.8% |
| Other values (5505) | 1302266 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1517215 | 11.2% |
| i | 1187099 | 8.7% |
| a | 1082276 | 8.0% |
| u | 980723 | 7.2% |
| o | 902387 | 6.6% |
| 889949 | 6.5% | |
| e | 862255 | 6.3% |
| r | 848292 | 6.2% |
| n | 665623 | 4.9% |
| l | 634731 | 4.7% |
| Other values (53) | 4029793 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12079597 | |
| Space Separator | 889949 | 6.5% |
| Uppercase Letter | 601771 | 4.4% |
| Other Punctuation | 28356 | 0.2% |
| Open Punctuation | 313 | < 0.1% |
| Close Punctuation | 313 | < 0.1% |
| Decimal Number | 44 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1517215 | |
| i | 1187099 | |
| a | 1082276 | 9.0% |
| u | 980723 | 8.1% |
| o | 902387 | 7.5% |
| e | 862255 | 7.1% |
| r | 848292 | 7.0% |
| n | 665623 | 5.5% |
| l | 634731 | 5.3% |
| t | 618435 | 5.1% |
| Other values (16) | 2780561 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 103156 | |
| P | 84557 | |
| C | 58907 | |
| S | 54594 | |
| T | 51645 | |
| A | 32571 | 5.4% |
| R | 31119 | 5.2% |
| G | 28180 | 4.7% |
| L | 23175 | 3.9% |
| N | 23069 | 3.8% |
| Other values (14) | 110798 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 13 | |
| 1 | 12 | |
| 2 | 7 | |
| 9 | 6 | |
| 5 | 3 | 6.8% |
| 0 | 2 | 4.5% |
| 4 | 1 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28343 | |
| , | 11 | < 0.1% |
| / | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 889949 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 313 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 313 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12681368 | |
| Common | 918975 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1517215 | |
| i | 1187099 | 9.4% |
| a | 1082276 | 8.5% |
| u | 980723 | 7.7% |
| o | 902387 | 7.1% |
| e | 862255 | 6.8% |
| r | 848292 | 6.7% |
| n | 665623 | 5.2% |
| l | 634731 | 5.0% |
| t | 618435 | 4.9% |
| Other values (40) | 3382332 |
Common
| Value | Count | Frequency (%) |
| 889949 | ||
| . | 28343 | 3.1% |
| ( | 313 | < 0.1% |
| ) | 313 | < 0.1% |
| 8 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| , | 11 | < 0.1% |
| 2 | 7 | < 0.1% |
| 9 | 6 | < 0.1% |
| 5 | 3 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13600343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1517215 | 11.2% |
| i | 1187099 | 8.7% |
| a | 1082276 | 8.0% |
| u | 980723 | 7.2% |
| o | 902387 | 6.6% |
| 889949 | 6.5% | |
| e | 862255 | 6.3% |
| r | 848292 | 6.2% |
| n | 665623 | 4.9% |
| l | 634731 | 4.7% |
| Other values (53) | 4029793 |
| Distinct | 253 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 121 |
|---|---|
| Median length | 113 |
| Mean length | 90.64064651 |
| Min length | 11 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Carnivora, Caniformia, Procyonidae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Rodentia, Myomorpha, Cricetidae, Arvicolinae |
| 3rd row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Chiroptera, Phyllostomidae, Carolliinae |
| 4th row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Rodentia, Myomorpha, Cricetidae, Neotominae |
| 5th row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Cetacea, Odontoceti, Delphinidae |
| Value | Count | Frequency (%) |
| animalia | 601442 | |
| vertebrata | 601442 | |
| chordata | 601442 | |
| mammalia | 601441 | |
| eutheria | 593341 | |
| rodentia | 297636 | 5.9% |
| myomorpha | 209417 | 4.1% |
| chiroptera | 129086 | 2.5% |
| cricetidae | 107243 | 2.1% |
| muridae | 93911 | 1.9% |
| Other values (328) | 1234181 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | 8.8% |
| , | 4469138 | 8.2% |
| 4469138 | 8.2% | |
| e | 4068524 | 7.5% |
| r | 4037606 | 7.4% |
| t | 3533330 | 6.5% |
| o | 2704288 | 5.0% |
| m | 2453478 | 4.5% |
| h | 1861673 | 3.4% |
| Other values (38) | 13737431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40506415 | |
| Uppercase Letter | 5070582 | 9.3% |
| Other Punctuation | 4469138 | 8.2% |
| Space Separator | 4469138 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | |
| e | 4068524 | |
| r | 4037606 | |
| t | 3533330 | |
| o | 2704288 | 6.7% |
| m | 2453478 | 6.1% |
| h | 1861673 | 4.6% |
| n | 1678993 | 4.1% |
| l | 1675363 | 4.1% |
| Other values (14) | 5312493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1067853 | |
| M | 1065447 | |
| A | 654487 | |
| V | 641586 | |
| E | 615945 | |
| R | 302881 | 6.0% |
| S | 237180 | 4.7% |
| P | 112443 | 2.2% |
| D | 65158 | 1.3% |
| N | 62146 | 1.2% |
| Other values (12) | 245456 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4469138 |
Space Separator
| Value | Count | Frequency (%) |
| 4469138 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45576997 | |
| Common | 8938276 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | |
| e | 4068524 | 8.9% |
| r | 4037606 | 8.9% |
| t | 3533330 | 7.8% |
| o | 2704288 | 5.9% |
| m | 2453478 | 5.4% |
| h | 1861673 | 4.1% |
| n | 1678993 | 3.7% |
| l | 1675363 | 3.7% |
| Other values (36) | 10383075 |
Common
| Value | Count | Frequency (%) |
| , | 4469138 | |
| 4469138 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54515273 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8383067 | |
| i | 4797600 | 8.8% |
| , | 4469138 | 8.2% |
| 4469138 | 8.2% | |
| e | 4068524 | 7.5% |
| r | 4037606 | 7.4% |
| t | 3533330 | 6.5% |
| o | 2704288 | 5.0% |
| m | 2453478 | 4.5% |
| h | 1861673 | 3.4% |
| Other values (38) | 13737431 |
kingdom
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 601442 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1202884 | |
| a | 1202884 | |
| A | 601442 | |
| n | 601442 | |
| m | 601442 | |
| l | 601442 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210094 | |
| Uppercase Letter | 601442 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1202884 | |
| a | 1202884 | |
| n | 601442 | |
| m | 601442 | |
| l | 601442 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 601442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811536 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1202884 | |
| a | 1202884 | |
| A | 601442 | |
| n | 601442 | |
| m | 601442 | |
| l | 601442 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811536 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1202884 | |
| a | 1202884 | |
| A | 601442 | |
| n | 601442 | |
| m | 601442 | |
| l | 601442 |
phylum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 601442 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1202884 | |
| C | 601442 | |
| h | 601442 | |
| o | 601442 | |
| r | 601442 | |
| d | 601442 | |
| t | 601442 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210094 | |
| Uppercase Letter | 601442 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1202884 | |
| h | 601442 | |
| o | 601442 | |
| r | 601442 | |
| d | 601442 | |
| t | 601442 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 601442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811536 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1202884 | |
| C | 601442 | |
| h | 601442 | |
| o | 601442 | |
| r | 601442 | |
| d | 601442 | |
| t | 601442 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811536 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1202884 | |
| C | 601442 | |
| h | 601442 | |
| o | 601442 | |
| r | 601442 | |
| d | 601442 | |
| t | 601442 |
class
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mammalia |
|---|---|
| 2nd row | Mammalia |
| 3rd row | Mammalia |
| 4th row | Mammalia |
| 5th row | Mammalia |
| Value | Count | Frequency (%) |
| mammalia | 601441 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1804323 | |
| m | 1202882 | |
| M | 601441 | 12.5% |
| l | 601441 | 12.5% |
| i | 601441 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4210087 | |
| Uppercase Letter | 601441 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1804323 | |
| m | 1202882 | |
| l | 601441 | 14.3% |
| i | 601441 | 14.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 601441 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4811528 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1804323 | |
| m | 1202882 | |
| M | 601441 | 12.5% |
| l | 601441 | 12.5% |
| i | 601441 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4811528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1804323 | |
| m | 1202882 | |
| M | 601441 | 12.5% |
| l | 601441 | 12.5% |
| i | 601441 | 12.5% |
order
Text
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.868953064 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Carnivora |
|---|---|
| 2nd row | Rodentia |
| 3rd row | Chiroptera |
| 4th row | Rodentia |
| 5th row | Cetacea |
| Value | Count | Frequency (%) |
| rodentia | 297636 | |
| chiroptera | 129086 | |
| cetacea | 47582 | 7.9% |
| carnivora | 47293 | 7.9% |
| soricomorpha | 30383 | 5.1% |
| lagomorpha | 11977 | 2.0% |
| artiodactyla | 11375 | 1.9% |
| primates | 10781 | 1.8% |
| didelphimorphia | 5643 | 0.9% |
| diprotodontia | 1652 | 0.3% |
| Other values (19) | 8033 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 725642 | |
| o | 618973 | |
| i | 555649 | |
| e | 546091 | |
| t | 514232 | |
| r | 462546 | |
| n | 351517 | |
| d | 320912 | |
| R | 297636 | |
| C | 224380 | 4.2% |
| Other values (22) | 716574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4732711 | |
| Uppercase Letter | 601441 | 11.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 725642 | |
| o | 618973 | |
| i | 555649 | |
| e | 546091 | |
| t | 514232 | |
| r | 462546 | |
| n | 351517 | |
| d | 320912 | |
| p | 186076 | 3.9% |
| h | 184413 | 3.9% |
| Other values (10) | 266660 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 297636 | |
| C | 224380 | |
| S | 32307 | 5.4% |
| P | 12509 | 2.1% |
| A | 12049 | 2.0% |
| L | 11977 | 2.0% |
| D | 7784 | 1.3% |
| M | 1503 | 0.2% |
| E | 940 | 0.2% |
| H | 341 | 0.1% |
| Other values (2) | 15 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5334152 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 725642 | |
| o | 618973 | |
| i | 555649 | |
| e | 546091 | |
| t | 514232 | |
| r | 462546 | |
| n | 351517 | |
| d | 320912 | |
| R | 297636 | |
| C | 224380 | 4.2% |
| Other values (22) | 716574 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5334152 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 725642 | |
| o | 618973 | |
| i | 555649 | |
| e | 546091 | |
| t | 514232 | |
| r | 462546 | |
| n | 351517 | |
| d | 320912 | |
| R | 297636 | |
| C | 224380 | 4.2% |
| Other values (22) | 716574 |
family
Text
| Distinct | 153 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 10.23417143 |
| Min length | 6 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Procyonidae |
|---|---|
| 2nd row | Cricetidae |
| 3rd row | Phyllostomidae |
| 4th row | Cricetidae |
| 5th row | Delphinidae |
| Value | Count | Frequency (%) |
| cricetidae | 107243 | |
| muridae | 93911 | |
| phyllostomidae | 55530 | 9.2% |
| sciuridae | 46130 | 7.7% |
| soricidae | 27470 | 4.6% |
| vespertilionidae | 25753 | 4.3% |
| delphinidae | 23642 | 3.9% |
| heteromyidae | 19997 | 3.3% |
| molossidae | 13560 | 2.3% |
| canidae | 12559 | 2.1% |
| Other values (143) | 175649 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 949239 | |
| i | 924898 | |
| a | 664049 | |
| d | 634246 | |
| r | 409806 | 6.7% |
| o | 348989 | 5.7% |
| t | 274918 | 4.5% |
| l | 232701 | 3.8% |
| c | 221432 | 3.6% |
| u | 159979 | 2.6% |
| Other values (32) | 1335024 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5553837 | |
| Uppercase Letter | 601444 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 949239 | |
| i | 924898 | |
| a | 664049 | |
| d | 634246 | |
| r | 409806 | |
| o | 348989 | 6.3% |
| t | 274918 | 5.0% |
| l | 232701 | 4.2% |
| c | 221432 | 4.0% |
| u | 159979 | 2.9% |
| Other values (12) | 733580 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 134035 | |
| M | 130155 | |
| P | 81969 | |
| S | 74875 | |
| D | 35147 | 5.8% |
| V | 26962 | 4.5% |
| H | 26673 | 4.4% |
| B | 14305 | 2.4% |
| G | 12230 | 2.0% |
| E | 11823 | 2.0% |
| Other values (10) | 53270 | 8.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6155281 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 949239 | |
| i | 924898 | |
| a | 664049 | |
| d | 634246 | |
| r | 409806 | 6.7% |
| o | 348989 | 5.7% |
| t | 274918 | 4.5% |
| l | 232701 | 3.8% |
| c | 221432 | 3.6% |
| u | 159979 | 2.6% |
| Other values (32) | 1335024 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6155281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 949239 | |
| i | 924898 | |
| a | 664049 | |
| d | 634246 | |
| r | 409806 | 6.7% |
| o | 348989 | 5.7% |
| t | 274918 | 4.5% |
| l | 232701 | 3.8% |
| c | 221432 | 3.6% |
| u | 159979 | 2.6% |
| Other values (32) | 1335024 |
genus
Text
| Distinct | 1136 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 16 |
| Missing (%) | < 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 8.505181774 |
| Min length | 2 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Potos |
|---|---|
| 2nd row | Microtus |
| 3rd row | Carollia |
| 4th row | Peromyscus |
| 5th row | Tursiops |
| Value | Count | Frequency (%) |
| peromyscus | 38753 | 6.4% |
| microtus | 19877 | 3.3% |
| rattus | 16463 | 2.7% |
| sorex | 15826 | 2.6% |
| artibeus | 12470 | 2.1% |
| carollia | 12281 | 2.0% |
| tursiops | 11894 | 2.0% |
| tamias | 11871 | 2.0% |
| mastomys | 11447 | 1.9% |
| mus | 10554 | 1.8% |
| Other values (1126) | 439999 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 603660 | 11.8% |
| o | 513897 | 10.0% |
| a | 349432 | 6.8% |
| r | 348676 | 6.8% |
| u | 336000 | 6.6% |
| i | 331710 | 6.5% |
| e | 315840 | 6.2% |
| t | 247363 | 4.8% |
| l | 220400 | 4.3% |
| m | 215951 | 4.2% |
| Other values (40) | 1632385 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4513879 | |
| Uppercase Letter | 601435 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 603660 | |
| o | 513897 | |
| a | 349432 | 7.7% |
| r | 348676 | 7.7% |
| u | 336000 | 7.4% |
| i | 331710 | 7.3% |
| e | 315840 | 7.0% |
| t | 247363 | 5.5% |
| l | 220400 | 4.9% |
| m | 215951 | 4.8% |
| Other values (16) | 1030950 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 102970 | |
| P | 84552 | |
| C | 58805 | |
| S | 54592 | |
| T | 51642 | |
| A | 32571 | 5.4% |
| R | 31119 | 5.2% |
| G | 28178 | 4.7% |
| L | 23171 | 3.9% |
| N | 23063 | 3.8% |
| Other values (14) | 110772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5115314 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 603660 | 11.8% |
| o | 513897 | 10.0% |
| a | 349432 | 6.8% |
| r | 348676 | 6.8% |
| u | 336000 | 6.6% |
| i | 331710 | 6.5% |
| e | 315840 | 6.2% |
| t | 247363 | 4.8% |
| l | 220400 | 4.3% |
| m | 215951 | 4.2% |
| Other values (40) | 1632385 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5115314 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 603660 | 11.8% |
| o | 513897 | 10.0% |
| a | 349432 | 6.8% |
| r | 348676 | 6.8% |
| u | 336000 | 6.6% |
| i | 331710 | 6.5% |
| e | 315840 | 6.2% |
| t | 247363 | 4.8% |
| l | 220400 | 4.3% |
| m | 215951 | 4.2% |
| Other values (40) | 1632385 |
subgenus
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 601149 |
| Missing (%) | 99.9% |
| Memory size | 4.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.82781457 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mallodelphys |
|---|---|
| 2nd row | Eumarmosa |
| 3rd row | Caluromys |
| 4th row | Caluromys |
| 5th row | Eumarmosa |
| Value | Count | Frequency (%) |
| mallodelphys | 184 | |
| caluromys | 95 | |
| eumarmosa | 23 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 647 | |
| a | 325 | |
| o | 302 | |
| s | 302 | |
| y | 279 | |
| M | 184 | 5.6% |
| d | 184 | 5.6% |
| e | 184 | 5.6% |
| p | 184 | 5.6% |
| h | 184 | 5.6% |
| Other values (5) | 495 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2968 | |
| Uppercase Letter | 302 | 9.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 647 | |
| a | 325 | |
| o | 302 | |
| s | 302 | |
| y | 279 | |
| d | 184 | 6.2% |
| e | 184 | 6.2% |
| p | 184 | 6.2% |
| h | 184 | 6.2% |
| m | 141 | 4.8% |
| Other values (2) | 236 | 8.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 184 | |
| C | 95 | |
| E | 23 | 7.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3270 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 647 | |
| a | 325 | |
| o | 302 | |
| s | 302 | |
| y | 279 | |
| M | 184 | 5.6% |
| d | 184 | 5.6% |
| e | 184 | 5.6% |
| p | 184 | 5.6% |
| h | 184 | 5.6% |
| Other values (5) | 495 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3270 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 647 | |
| a | 325 | |
| o | 302 | |
| s | 302 | |
| y | 279 | |
| M | 184 | 5.6% |
| d | 184 | 5.6% |
| e | 184 | 5.6% |
| p | 184 | 5.6% |
| h | 184 | 5.6% |
| Other values (5) | 495 |
specificEpithet
Text
| Distinct | 2774 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 678 |
| Missing (%) | 0.1% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 8.402236785 |
| Min length | 2 |
Unique
| Unique | 260 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | flavus |
|---|---|
| 2nd row | longicaudus |
| 3rd row | brevicauda |
| 4th row | mexicanus |
| 5th row | truncatus |
| Value | Count | Frequency (%) |
| sp | 28335 | 4.7% |
| maniculatus | 15647 | 2.6% |
| truncatus | 11873 | 2.0% |
| musculus | 8553 | 1.4% |
| perspicillata | 8339 | 1.4% |
| leucopus | 7382 | 1.2% |
| brevicauda | 7356 | 1.2% |
| pennsylvanicus | 6840 | 1.1% |
| jamaicensis | 5581 | 0.9% |
| rattus | 5466 | 0.9% |
| Other values (2764) | 495405 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 600065 | |
| i | 553121 | |
| a | 505496 | |
| u | 460127 | |
| e | 329136 | 6.5% |
| r | 328084 | 6.5% |
| n | 326046 | 6.5% |
| l | 286917 | 5.7% |
| t | 270410 | 5.4% |
| c | 259844 | 5.1% |
| Other values (19) | 1128591 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5019496 | |
| Other Punctuation | 28337 | 0.6% |
| Space Separator | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 600065 | |
| i | 553121 | |
| a | 505496 | |
| u | 460127 | |
| e | 329136 | 6.6% |
| r | 328084 | 6.5% |
| n | 326046 | 6.5% |
| l | 286917 | 5.7% |
| t | 270410 | 5.4% |
| c | 259844 | 5.2% |
| Other values (16) | 1100250 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28335 | |
| / | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5019496 | |
| Common | 28341 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 600065 | |
| i | 553121 | |
| a | 505496 | |
| u | 460127 | |
| e | 329136 | 6.6% |
| r | 328084 | 6.5% |
| n | 326046 | 6.5% |
| l | 286917 | 5.7% |
| t | 270410 | 5.4% |
| c | 259844 | 5.2% |
| Other values (16) | 1100250 |
Common
| Value | Count | Frequency (%) |
| . | 28335 | |
| 4 | < 0.1% | |
| / | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5047837 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 600065 | |
| i | 553121 | |
| a | 505496 | |
| u | 460127 | |
| e | 329136 | 6.5% |
| r | 328084 | 6.5% |
| n | 326046 | 6.5% |
| l | 286917 | 5.7% |
| t | 270410 | 5.4% |
| c | 259844 | 5.1% |
| Other values (19) | 1128591 |
Missing 
| Distinct | 2646 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 314922 |
| Missing (%) | 52.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 8.827504371 |
| Min length | 3 |
Unique
| Unique | 227 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | longicaudus |
|---|---|
| 2nd row | totontepecus |
| 3rd row | marinensis |
| 4th row | bairdii |
| 5th row | merriami |
| Value | Count | Frequency (%) |
| noveboracensis | 4836 | 1.7% |
| domesticus | 4357 | 1.5% |
| pennsylvanicus | 4127 | 1.4% |
| talpoides | 3712 | 1.3% |
| cinereus | 3602 | 1.3% |
| sonoriensis | 2279 | 0.8% |
| gambelii | 2247 | 0.8% |
| trowbridgii | 2145 | 0.7% |
| merriami | 2101 | 0.7% |
| longicaudus | 2081 | 0.7% |
| Other values (2636) | 255042 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 311070 | |
| i | 300653 | |
| a | 225909 | |
| e | 216894 | |
| n | 194319 | 7.7% |
| u | 182648 | 7.2% |
| r | 170773 | 6.8% |
| o | 145249 | 5.7% |
| l | 125981 | 5.0% |
| c | 119143 | 4.7% |
| Other values (16) | 536697 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2529336 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 311070 | |
| i | 300653 | |
| a | 225909 | |
| e | 216894 | |
| n | 194319 | 7.7% |
| u | 182648 | 7.2% |
| r | 170773 | 6.8% |
| o | 145249 | 5.7% |
| l | 125981 | 5.0% |
| c | 119143 | 4.7% |
| Other values (16) | 536697 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2529336 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 311070 | |
| i | 300653 | |
| a | 225909 | |
| e | 216894 | |
| n | 194319 | 7.7% |
| u | 182648 | 7.2% |
| r | 170773 | 6.8% |
| o | 145249 | 5.7% |
| l | 125981 | 5.0% |
| c | 119143 | 4.7% |
| Other values (16) | 536697 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2529336 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 311070 | |
| i | 300653 | |
| a | 225909 | |
| e | 216894 | |
| n | 194319 | 7.7% |
| u | 182648 | 7.2% |
| r | 170773 | 6.8% |
| o | 145249 | 5.7% |
| l | 125981 | 5.0% |
| c | 119143 | 4.7% |
| Other values (16) | 536697 |
taxonRank
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 314922 |
| Missing (%) | 52.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | subspecies |
| 3rd row | subspecies |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 286529 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 859587 | |
| e | 573058 | |
| u | 286529 | 10.0% |
| b | 286529 | 10.0% |
| p | 286529 | 10.0% |
| c | 286529 | 10.0% |
| i | 286529 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2865290 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 859587 | |
| e | 573058 | |
| u | 286529 | 10.0% |
| b | 286529 | 10.0% |
| p | 286529 | 10.0% |
| c | 286529 | 10.0% |
| i | 286529 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2865290 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 859587 | |
| e | 573058 | |
| u | 286529 | 10.0% |
| b | 286529 | 10.0% |
| p | 286529 | 10.0% |
| c | 286529 | 10.0% |
| i | 286529 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2865290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 859587 | |
| e | 573058 | |
| u | 286529 | 10.0% |
| b | 286529 | 10.0% |
| p | 286529 | 10.0% |
| c | 286529 | 10.0% |
| i | 286529 | 10.0% |
Missing 
| Distinct | 176 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 555607 |
| Missing (%) | 92.4% |
| Memory size | 4.6 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 8.940755606 |
| Min length | 4 |
Unique
| Unique | 69 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | (Montagu) |
|---|---|
| 2nd row | (Montagu) |
| 3rd row | (Linnaeus) |
| 4th row | (Cuvier) |
| 5th row | Stejneger |
| Value | Count | Frequency (%) |
| linnaeus | 14516 | |
| montagu | 11845 | |
| gray | 4024 | 8.2% |
| cuvier | 2015 | 4.1% |
| de | 1265 | 2.6% |
| blainville | 1263 | 2.6% |
| traill | 1101 | 2.2% |
| true | 1018 | 2.1% |
| lacepede | 1006 | 2.0% |
| lilljeborg | 934 | 1.9% |
| Other values (159) | 10239 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 46261 | |
| ( | 37392 | 9.1% |
| ) | 37392 | 9.1% |
| a | 37146 | 9.1% |
| e | 31946 | 7.8% |
| u | 30372 | 7.4% |
| i | 24799 | 6.1% |
| s | 19879 | 4.8% |
| L | 17196 | 4.2% |
| o | 17121 | 4.2% |
| Other values (45) | 110376 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 283189 | |
| Uppercase Letter | 47324 | 11.5% |
| Open Punctuation | 37392 | 9.1% |
| Close Punctuation | 37392 | 9.1% |
| Space Separator | 3382 | 0.8% |
| Other Punctuation | 1198 | 0.3% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 46261 | |
| a | 37146 | |
| e | 31946 | |
| u | 30372 | |
| i | 24799 | |
| s | 19879 | |
| o | 17121 | 6.0% |
| r | 14853 | 5.2% |
| g | 13602 | 4.8% |
| t | 13339 | 4.7% |
| Other values (16) | 33871 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 17196 | |
| M | 13479 | |
| G | 5210 | 11.0% |
| B | 2551 | 5.4% |
| T | 2132 | 4.5% |
| C | 2021 | 4.3% |
| O | 1040 | 2.2% |
| P | 767 | 1.6% |
| F | 755 | 1.6% |
| D | 616 | 1.3% |
| Other values (12) | 1557 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 683 | |
| ' | 426 | |
| . | 89 | 7.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37392 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37392 |
Space Separator
| Value | Count | Frequency (%) |
| 3382 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 330513 | |
| Common | 79367 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 46261 | |
| a | 37146 | |
| e | 31946 | |
| u | 30372 | 9.2% |
| i | 24799 | 7.5% |
| s | 19879 | 6.0% |
| L | 17196 | 5.2% |
| o | 17121 | 5.2% |
| r | 14853 | 4.5% |
| g | 13602 | 4.1% |
| Other values (38) | 77338 |
Common
| Value | Count | Frequency (%) |
| ( | 37392 | |
| ) | 37392 | |
| 3382 | 4.3% | |
| & | 683 | 0.9% |
| ' | 426 | 0.5% |
| . | 89 | 0.1% |
| - | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 409880 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 46261 | |
| ( | 37392 | 9.1% |
| ) | 37392 | 9.1% |
| a | 37146 | 9.1% |
| e | 31946 | 7.8% |
| u | 30372 | 7.4% |
| i | 24799 | 6.1% |
| s | 19879 | 4.8% |
| L | 17196 | 4.2% |
| o | 17121 | 4.2% |
| Other values (45) | 110376 |